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ABSTRACT 


This thesis investigates possible relationships between the amount of 
Architect/Engineer firm (A/E) liability and A/E costs and/or construction 
costs. A/E's provide plans and specifications for Government construction 
projects. Any changes which occur during the course of construction can 
lead to A/E liability when the corrective work includes unproductive costs. 
The data for this thesis includes the A/E design costs, construction costs, 
and the amount of A/E liability for each project. The projects were 
Classified as not complex, complex, and very complex based on the 
construction characteristics of each project. Regression analysis was used 
In the search for a predictive mathematical equation. The best equation 
explained only 29.0% of the variation in A/E liability. Consequently, this 
Study did not establish any significant relationship between the amount of 
A/E liability and A/E costs and/or construction ccests. The background 
research did indicate that the A/E should have a much greater involvement 
IN aproject during construction. It is recommended that the project A/E be 


required to make site visits at least weekly. 
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|. INTRODUCTION TO ARCHITECT/ENGINEER LIABILITY 


A. INTRODUCTION 

The Government has tasked the military construction project manager 
with the duty of documenting design deficiencies and attempting to recover 
the additional costs incurred as aresult of the design deficiency. Errors or 
omissions In the design lead to additional costs to the Government in the 
form of change orders to the construction contract. When these design 
deficiencies result from negligence on the part of the designer, the 
Architect/Engineer firm (A/E) that designed the project for the Government 
may be liable for these additional costs. Historically, there has been little 
or no sucess in recovery of these costs from the A/E. 

This study was done with the underlying purpose of finding ways of 
Improving the recovery of these A/E liability costs and minimizing the 
number of design errors. If a manager could predict the incidence and 
amount of A/E Niabilit on future contracts, then the manager could place 
more emphasis on minimizing the damages to all parties. As the 
Government successfully pursues A/E liability, the A/Es will improve their 
design work and eliminate many of the design deficiencies. Fewer 
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deficiencies mean less management effort in handling construction change 
orders; in other words, a “win -win’ situation. Data from over one hundred 
construction projects from 1976 to 1984 were evaluated and analyzed 
through regression analysis. No usable mathematical equation was found to 
predict the amount of A/E liability on future projects. However, the results 
indicated a higher chance of A/E liability occurrence in very complex 
projects. 

The background of A/E liability and its evolution are contained in Chapter 
!, The origins of the data for this thesis and the method of research are 
presented in Chapter i1!. The steps used in the search for a mathematical 
model predicting the amount of A/E liability are described in Chapter IV. 


Finally, the conclusions and recommendations are contained in Chapter V. 


1], WHAT 1S ARCHITECT/ENGINEER LIABILITY? 


A. INTRODUCTION 


A/ES provide plans and specifications for construction projects. Thess 


contractor. During the course of construction, changes to the plans and 
Specifications involving design errors or omissions can lead to A/E liaoriity 
The corrective work to the design problem may involve paying tne 
construction contractor for unproductive costs. (Uneroductive costs ars 
those costs which add no value to the construction project.) The costs may 
inciude rework, impact, or extended overhead costs. Rework takes place 
when the correction to the design requires work-in-place to be removed and 
then replaced in a different configuration. The construction contractor 
receives additional compensation for this rework, dut no adcitional value is 
added to the project. Impact costs relate to additional moves within a job, 
set-backs in the learning curve of the craftsmen who have been pulled off a 
job while awaiting the design change, come-backs or return trips by 
specialty trades, and crew size inefficiency. A design change during 
construction can affect the efficiency of the construction contractor's 
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scheduling. The resulting impact costs add to the project's cost, but do not 
add value. Extended overhead costs of the construction contractor include 
the overhead costs (field office, equipment, personnel, etc.) of staying 
mobilized longer than originally planned. The Critical Path Method (CPM) of 
scheduling is typically used on construction contracts. The CPM can prove 
that a corrective design extended the critical patn and caused. the 
construction contractor to stay mobilized longer than planned. The A/E is 


justly responsible f or these unproductive costs. 


B. PRIVITY OF CONTRACT 

An owner of property contracts with an A/E for design Services and then 
contracts with a construction contractor to complete the work identified on 
the plans and specifications. There typically has been no contractual 
arrangement between the construction contractor and the A/E. Parties to a 
contract have a legal right to sue each other for nonperformance if 
necessary. This right is a consequence of privity of contract, which means 
that there is a direct and immediate legal relationship between the parties 
to a contract. The construction contractor could not sue the A/E for 
damages resulting from an improper set of plans and specifications, because 
there was no legal contract between the A/E and the construction 
contractor. By the late 1950's the courts began to reevaluate this premise. 
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The Federal Courts in a California district ruled on a case in 1958. The 
construction contractor placed defective concrete in a Government bullding 


—~= « 
' 
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project, and the concrete subsequently had to be removed and replaced. [ne 
Government contracted with the A/E for the design of the building and 
supervision during construction. The construction contractor sued the A/t 
for the rework costs that resulted because of negligent supervision. Tre 
A/E's lawyers argued the case should be dismissed because their client aid 
not have a contract with the construction contractor. The court meld that 
the rule requiring privity in this type of case was no longer valid. The court 
concluded that an A/E negligent in performance may be held liable to the 
construction contractor or any third party. Tne courts reasoning was that: 
Considerations of reason and policy impel the conclusion that the position 
and authority of a supervising architect are such that he ought to labor 
under a duty to the prime contractor to supervise the project with cue 
care under the circumstances, even though his sole contractua! 
relationship is with the owner, nere the United States. Altogether too 
rauch control over the contractor necessarily rests in the hands of the 
Supervising architect for him not to be placed under a duty imposed by law 
te perform without negligence his functions as they affect the contractor 
The power of the architect to stoo the work done Is tantamount to a 
power of economic life or deatn over the contractor. {It is only just that 
such authority, exercised in such a relationship, carry commensurate lega! 
responsibility.{[Ref. 1, p. 6] 
This ruling implied that the A/E could be exposed to potential ilability from 


any professional contacts, whether a contractual relationship existed or 


not. 


The key to the above case, as well as the many that have followed, is 
that privity of contract no longer can be used as a legal defense by A/ES 
against construction contractors. Where the A/E actions or inactions lead a 
construction contractor down a path where the contractor suffers economic 
damage, and it can be shown that the A/E was negligent, the A/E is liable to 
the construction contractor for the economic damages. Negligence by the 
A/E leads to A/E liability. “One is not ‘negligent’ unless he fails to 
exercise that degree of reasonable care that would be exercised by a person 
of ordinary prudence under all the existing circumstances in view of 
probable danger of injury” [Ref. 2} A/Es can only be found negligent if they 


fail to exercise ordinary skill in their involvement in a project. 


C. ORDINARY SKILL 
A/ES are expected to use ordinary skill and care in all aspects of their 
involvement in a construction project. There are numerous types of 
deficiencies that may occur if normal or ordinary skill is not exercised. 
They include 
defects and deficiencies in the preparation of the plans and specifications; 
tne failure to properly supervise the project; the unreascnable delay by the 
architect in preparation of revised plans and specifications; unreasonable 
delay in the issuance of change orders; failure to properiy check or approve 


submittals, such as shop crawings; and the improper directives that an 
architect may give to anyone working on the project. Ref. 1, p. 8] 
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Along with these items, risk factors exist concerning the A/E liabilities for: 


a) Failure to observe traditional responsibilities to employ reasonable 


b) 


Cc» 
~~ 


qd) 


atl 


C 


f) 


judgement, diligence and care. 


Failure to deploy adequate numbers of personnel to perform work on a 
timely basis. 


Failure to fake “all measures” in counseling a client about nis options, 
or to bring about full analysis of all risks among project participanis. 


Failure to, provide all legal, code, regulatory, safety, equal employment 
opportunity or QA/QC requirements in design. 


Failure to warn or properly evaluate an owner's potential risk in 
accepting an innovation in concept or new material. 


Failure to provide adequate contract conditions for guiaance of 
Suppliers and contractors to define interface responsibilities noting 
proper accountability for same[Ref. 3, p. 7, Vol. 1] 


In any of the above events the A/E’s responsibilities are ~ to exercise the 


‘ordinary, reasonable technical skill, ability and competence’ that would be 


expected of a qualified professional in a similar situation” [Ref. 1, p. 7]. 


The failure to use due care while involved in a construction project does 


not automatically cause A/E liability. If a design error existed in a project, 


and the A/E corrected the error in such a way as to prevent damage to the 


construction contractor, there would be no liability. Even if the arrange- 


ment between the owner and the A/E specifies the highest standards of the 


profession, “the architect will only be held to the normal standard of the 


profession in actions brought by third parties for negligence” [Ref. 1, p. 46]. 


1] 


The architect's liability is tied directly to the obligations assumed under 
his contract with the owner. To the extent that the architect contracts to 
assume broader responsibility in connection with the administration of 
the construction project, he must be prepared to ciscnarge thal 
responsibility with the same care, skill, Knowledge and cispatcn [that a 
court will find an ordinary member of his prefession would exercise In the 
performance of such obligations. His failure to do so will subject him fo 
either (1) derivative liability to contractors in suits dy owners for 
indemnification for delay damages oeing sougnt by tne contractor oy virtue 
of the architect's breach of his contract with tne owner; or (2) cirect 
liability to tne contractor for delay damages resulting trom his Failure co 
nerform with do care the obligations undertaken. {Ref. |, p. 49) 


Negligence for A/Es {is tied directly to their obligations anc co tneir skit} 
and care in performing those obligations. A good definition of regiigence °s 
as FOliows: 
A person is negligent when he fails to exercise ordinary care. Ordinary 
care is the degree of care which the great mass of mankind orainarnae 
exercises under the same or similar circumstances. A rcerson fails to 
exercise ordinary care when, without intending to do any wrong, he does an 
act or omits a precaution under circumstances in whicn a persan of 
ordinary intelligence and prudence ought reasonably to foresee that such 
act or omission will subject him or his property, or the person or propert 
of another to an unreasonable risk or damage. (Ref. 4, p. 31) 
The A/ES are at risk in any contract with an owner for services per- 
taining to a construction project. [{f the construction contractor ts 
economically damaged because the A/E failed to use orainary skills, the A/E 


can be held liable to the construction contractor for those damages. What 


nappens in this situation wnen the owner is the United States Government’? 


D. THE GOVERNMENT AS THE OWNER 

For most Government construction projects an A/E provides the design. 
This usually includes the plans and specifications, as well as follow-on 
Sep ecs 0) CONStNUCtION SILe VISILS, design error or omission corrections, 
and submittal review. Submittals are product descriptions of the equipment 
the ere truettion contractor intends to install. These submittals are sent 
to the A/E for review to insure that the equipment complies with the 
contract specifications. 

The Congress of the United States authorizes and appropriates the 
money for Military Construction projects. Organizations such as Western 
Division, Naval Facilities Engineering Command (WESTDIV) will then 
contract with an A/E firm for design services. The completed plans and 
Specifications are then advertized and awarded, in accordance with the 
Federal Acquisition Regulations, to the low responsive responsible bidder 
ror construction. The owner's representative in the field is the Resident 
Officer in Charge of Construction (ROICC), who has day to day inspection 
responsibilities. The ROICC submits routine correspondence, submittals, 
and requests for information from the construction contractor to the A/E as 
wel] as to WESTDIV. It is the ROICC’s responsibility to take steps necessary 


to Insure a timely response for this information. 


(a! 


The request for information and correspondence often points out design 
deficiencies (errors or omissions) in the plans and specifications. [he 
correction of these deficiences usualiy causes economic Gamage to -fis 
construction contractor. The Government then must pay the construction 
contractor for the amount of the damages incurred. In the past the 
Government rarely pursued remuneration for these damages caused by the 

A/E. The Government just did not hold the A/E liable for their actions. 


On July 14, 1977, the Comptrolier General of the United States, ina - 
fo Congress, advised the Congress that the agencies of the Feder 
Government have been required to pay to construction contractors an 
amounts of money because of negligent errors and/or omissions In tne 
plans and specifications prepared by A/ES engaged by tne Government. Tne 
report states that the agencies have not made a serious effort Co document 
the design deficiences (to describe the deficiency and the extent agua 
cnaracter of the A/E’s Involvement and responsibility) althought required 
to do so by regulations and procedures established for such agencies. 
secondly, that the agencies were not attempting to reccver from the A/E 
fhe additional cost incurred by the Government as aresult of these design 
dericiencies. Finally, the agencies have not been evaluating the A/E’s 
performance properiy and have not coordinated the information regarding 
the A/E's performance and have not disseminated the information to the 
other agencies empicying A/E's. | Ref. 5, p. 1] 


The above GAO report has lead to several changes in the way the Naval 
racilities Engineering Command (NAVFAC) does business. The NAVFAC 
Contracting Manual contains the following discussion: 

a. For the ourpose of these criteria, a design deficiency is any deficiency 
in the designs, crawings and soecifications that may result in the 


Governmen: incurring damages. A-E liadiiity fora design deficiency may 
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result from either an A-E’s negligent failure to meet the standard of 
care reasonably associated with the A-E profession or its breach of a 
contractual duty of skill and care in performing design services. The 
determination of A-E liability requires answering afffirmatively the 
Following questions: 


(1) Is the construction modification attributable to a cesign deficiency’? 


(2) Does the design deficiency stem from an error or omission by [ns 
ATE”? 


(3) Does the design deficiency by the A-E result from tne A-E 5 reghigent 
failure to meet the becttaee of care reasonably associated with the 
A-E profession or from a creach of contractual culty’ 


(4) Has the Government suffered damage as a result of the desi 
Geficiency’? 


we) 
t 

ay 
~4 


b. irrespective of whether A-E flabilify is pursued under a fort : 
contractual theory, tne professional standard of care [0 be applied 's ine 
Same standard used in cases involving alleged malpractice of doctors, 
lawyers and other professionals who sel] their services for compensation. 
This standard essentially requires an A-E fo exercise sucn reasonabie 

care, skill and diligence as one in that profession would ordinartiy 
exercise under similar circumstances. Failure to exercise normal ski!! and 

competence is usually negligence. Howerver, consideration should be given 
to representations of special skills and abilities made Dy the A-E anda 
relied upon by tne Navy in the selection progress. If negligence is believed 
to be involved, documentation should clearly state so. An A-E'S 
performance musf be negligent in order for tne Government to collect for 
damages.{Ref. 6, p. J-2] 


Once again, the A/E failure to exercise ordinary skill and competence 
leads to negligence. The economic damage to the Government trem a design 
deficiency must be evaluated. The construction contractor is entitled to all 
valid costs incurred as a result of a change order to the plans and 
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specifications. The Government is responsible for paying those costs ‘for 
additional work which would have increased the original old had the 
additional work been reflected in the original contract documents” [Ref. 5, p. 
P21 The cost for this additional work is considered value added. However, 
unproductive costs associated with the acditional work Gor which the 
construction contractcr is aiso entitied to compensation) are conside! Seams 
the A/E liability category. The unproductive A/E liability costs are 
generally limited to tne rollowing: 


(a) Tear out and reolacement costs: 


~ 


(Db) Restocking and rehandliag- ef smatersals delivered out 
Pogiaaec: 


“a, XN 
GS loner 


(c) Delay costs ( extended overhead and impact costs); 
(d) Government investigation and design costs; 

(e) A/E field visits costs; 

(f) Input costs on unitemized work. [Ref. 5, p, 2] 

Normally the A/E will not pay any of these costs at the time of the 
change. This means that the Government must pey the construction 
contractor for not only the value added portion of the change but also all of 
tne costs associated with A/E lialility. The Government must then pursue 


remuneration for the A/E iability. 


Ill. METHOD OF RESEARCH 


A. INTRODUCTION 

Western Division, Naval Facilities Engineering Command provided the 
data source for this study. WESTDIV’s primary mission includes the design, 
construction and maintenance of public works, public utililies, special 
Facilities, public housing and vehicle management. WESTDIV, located in San 
Bruno, California, provides support to United States Navy activities and 
some other Federal Agency activities in the western part of the United 
States and Alaska. An extensive information data base provides 
documentation of all WESTDIV awarded A/E contracts and construction 
contracts. From this data base the following information was obtained for 
each project: the A/E contract number; the A/E’s name; the design costs; 
the Construction Contract Support Service costs; the amount of A/E 
lability, if any; the settlement amount; the construction contract number; 
the contractor's name; the award amount; the construction modification 
amount; and the type of facility. These data were used to study 
relationships leading to an A/E ‘lability prediction mode}. Assuming a 
significant relationship existed, the incidence and amount of A/E liability 
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could be predicted. Management ccould use this information to predict the 
incidence of A/E liability. | Management would then be more alert for 


opportunities to minimize design differences. 


B. THE SAMPLE 

The primary data source came from the “A-E liability processing status 
log” (the log) of 1 July 1986. The log was composed of about 110 projects, 
from 1976 to 1984, which contained at least one design coded change order. 
A project with a construction contract change order resulting from a design 
error or omission is considered a design coded change. The log included the 
A/E contract number and the Construction Contract number, the title and 
location of the project, the A/E Responsibility Board meeting action date, 
the date a letter was sent to the A/E requesting settlement, the amount of 
potential liability, the status of processing, and the settlement received. 
The A/E Responsibility Board (the board) is composed of members from the 
design, construction, and contracts departments, a legal counsel, and a 
recorder, and is chaired by the head of the acquisition department. The 
board “prepares final determination to pursue or not pursue 2 Hfability...” 
(Ref. 7, p. 5}. 

The A/E and construction contract numbers from the log were used to 
create two more files from the data base. The first file, the A/E costs, 
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contained the total A/E costs associated with each project. These costs 
included the initial award amount, any modifications to the contract during 
project design, and Construction Contract Support Services (CCSS), CCSS is 
for follow-on services such as construction site visits, submittal reviews, 
etc. The second file, the construction contractor costs, provided tne total 
cost paid to the construction contractor for each.project. The costs 
included the initial award amount plus any modifications to the contract. 
The three sources of data were then compiled for a total usable data 
base of 91 projects. For a few projects the contract numbers on the Log 
were not listed correctly. This led to no data or erroneous data on the A/E 
Sooe or ine CORMStruction Contract cost files. There were also a few 
inadvertent omissions or errors in entering the contract number into the 
data base to obtain the cost files. Only those projects which had the data 


from all three sources were used, 


C. LIABILITY 

When an apparent design deficiency occurs during the construction of a 
project, the ROICC is required to submit a Determination and Findings (D&F) 
Co WESTDIV. "A D&F is a written statement of relevant facts, 
determinations, findings, and recommendations pertinent to change orders 
involving potential A-E liability” [Ref. 8, p. 3]. This D&F is reviewed by 
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Several people within WESTDIV. These include the Engineer in Charge of the 
project, the head of the Design Department, the head of the Construction 
Department, and the A/E Responsibility Board. All of the above people 
make recommendations to pursue or not pursue A/E liability. In fact, most 
of the D&Fs do not lead to A/E liability. Where the head of the Construction 
Department and neac of Design Department agree not to pursue A/E liability, 
the D&F is not forwarded to the board. 

This study incluced only those projects wnere a D&F was forwarded to 
the poarc. If the board recommended no liability, then comciled researca 
data (Appendix A) snow no liability for the particular project. ‘when the 
Doard recommended pursuing liability, the estimated potential lability 
amount is shown in Appendix A. The board's determination on A/E liability 
was considered the most appropriate place to first consider a project as 
containing A/E liability. When the board recommends pursuing A/E liability, 
a letter so stating is forwarded to the A/E. During this study, efforts were 
taken to prove a relationship existed between the incident of A/E liability 


and job complexity. 


D. JOB COMPLEXITY 
The job complexity was broken into three category codes, not complex 
(NC), complex (C), and very complex (VC). These codes were assigned on the 
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basis of the following list of construction project characteristics: 
Number Weight Characteristic 


ig Additions to existing structures, mechanical systems, 
PlCCLRIGAisCVSTEMS a CLC. 


ee | Multi-story construction 

@ Very specialized construction (¢.g., missiie nagazines? 
4. Very hazardous construction 

=) opecial security requirements 

oF Exceptional unforseen concitions 


The project was considered not complex if from zero to one of the above 
Ge) acteristics was present, complex if two to (hree were oresent, and very 
complex if four or more were present. Equal weight was given to all of the 
characteristics, with the exception of the very specialized construction, 
which was given a double weight. The nature of specialized construction 
alone was considered enough to increase the complexity of a construction 
project at least into tne complex range. The complexity codes were 
assigned to the 91 projects studied in an objective manner based on the 
presence or absence of the six construction project characteristics. The 


complexity codes were then used to establish the analysis groups for the 


data collected. {[Ref. 9, p. B-IX! 


F. ANALYSIS 

The three complexity codes (not complex, complex and very complex) 
determined in which group each contract was to be analyzed. For example, 
in the not complex group two different regression models were set up. 
First, {ne amount of liability was regressed against the A/E costs and tnen 
the amount of liability was regressed against the construction costs. 
regressions were done for all tnree complexity codes fo determine existing 
correlation patterns. The regression analysis was then performed on the 
combined data from the three complexity groups. The actual analysis 


orecedures and results are described inthe next chapter. 


ae 
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IV. ANALYSIS 


A. INTRODUCTION 

The primary goal of analyzing the 91 projects in Appendix A was to 
establish a relationship to predict the amount of A/E liability on future 
contracts. Regression was used to show the relationship. “Regression is a 
technique of quantifying relationshios between variables” [Ref. 10, p. | 
Chap.2]. The regression analysis was done with a Minitab computer 
software program on the Naval Postcraduate School's IBM 3033 computer. 
First the data from Appendix A were compiled into three groups - not 
complex, complex, and very complex projects. Then, in each group, A/E 
liability (column *8 in Appendix A) was regressed against the A/E costs 
(column *4 in Appendix A), against construction costs (column *6 in 
Appendix A), and finally against both A/E costs and construction costs in a 
multiple regression analysis. Two explanatory variables, A/E costs and 
construction costs, were used because they could be reliably measured. The 
explanatory variables are used in an equation to predict the dependent 


variable, A/E liability in this case. 
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For each project, the award amount for the design contract and all 

contract modification costs are included in the A/E costs column (co! 
#3 in Appendix A). These costs are known with certainty at the time of 
award of the construction contract. The project design is complete and al! 
of the costs for the desicn Mave seen paid Althoucn the consi 
contract sup services (CCSS) costs are negotiated’ wily ins Ayam 
about the same time as tne construction contract award, they may be 
amended during tne course of the construction. The historical data for the 
9! protects studied inclucee the CCS5 costs. At the time cf award Gt aie 
copstructicn comtract, the CCSS coses would et be Known wilt ¢=aae 
Therefore, the CCSS costs could skew the resuits of a prediction model. The 
CCS5 costs of the 91 projects were removed from the A/E design costs anc 
not included in this study (the data in columns #3 and #4 in Appendix A did 
NOU MeIMCes CSS Costs): 

The construction costs (columns #35 and #6 in Appendix A) contain only 
the amount originally awarded for the contract. [nis figure 1s Known with 
certainty at the time of award. Although the construction modification 
costs were known for the 91 projects studied, only an estimate could be 
used for a new project. The uncertainty of the modification costs coula 
Skew the results of a prediction madel. therefore, the modification costs 
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of the GI projects were deducted from the construction costs ana not 
included in this study. 

Prior to running any analysis, the data were adjusted for the effects o 
inflation. Inflation can cause contamination of the data and skew or 
invaliGate the rearession model. “fhe data should be expressed in constant 
dollars to avoid contamination Dy changes in the price levels over time’ 
Ref. 10, p. 31, Chap. 2]. The inflation adjustment is made by using the New 
Construction Index which is obtained by studying the variations in costs 
from year to year of similar construction projects. The differences tn costs 
are due Co inflation or deflation. A base year is arbitrariiy picked and set 
equal to 100. Dividing 100 by the index number for another year provides ; 
correction factor for that year. When multiplied by the correction factor, 
tne construction costs for that year can then be compared to the base year 
without effects of inflation. The New Construction Index used 1977 as the 
base year [Ref. |! and 12]. The index and the correcticn factor were 
displayed in Appendix 8 for the years 1975 through 1984 The cata in 
Appendix A were adjusted to 1977 constant dollars. Columns #4, 6, 8, and 
1Q of Appendix A contain the inflation adjustea costs - #4, adjusted A/E 
costs; *6, adjusted construction costs; *8, adjusted A/E liability; and 


*10, adjusted amount of liability settlement. The years of the contracts 
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for the design and construction of the project are listed in column *1 in 
Appendix A. When the year for the design and the construction is the same, 
only one year is listed. 

As stated above, the explanatory variables are Known with certainly, not 
only for the the 91 projects studied but 5 for any future project at the 
time of construction contract award. Now all that is needed is a 
mathematical equation to predict the amount of A/E liability on future 


contracts by reference to the values of the known variables. 


B. REGRESSION ANALYSIS OF NOT COMPLEX PROJECTS 

There were 34 projects in the not complex (NC) category. Appendix C 
contains the data and all of the regression analysis results. The A/E and 
construction cost figures in Appendix A were recorded to the nearest 
thousand dollar level, so after the data from the 34 NC projects were 
entered into the computer the A/E costs and the construction costs were 
both multiplied by 1000. There are five columns of data in Appendix C. They 
are Cl, anumerical listing of the projects; C2, the A/E costs in thousands; 
C3, construction costs in thousands; C4, A/E liability; C5, the A/E costs; 
and C6, the construction costs. The first regression of A/E liability (C4) 
against the A/E costs (C5) was performed. The equation for each regres- 
Sion is provided in the Appendix. In this case the equation is: A/E liability 
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(C4) = 21366 + 0.0257 x A/E costs{C5). This equation has a coefficient of 
determination (R2) of only 49%. This means that 49% of the sample 
variation in A/E liability, the dependent variable, can be explained by the 
change in A/E costs, the independent variable. [Ref. 10, p. 13, Chap. 2] An 
RZ of 4.9% means the regression equation Is a very poor predictor. 

From nere on only the R* value will be discussed; the equations will not 
be included in the body of this paper unless the regression equation is 
Significant, as determined by R4 and other factors. Appendix C contains all 
of the equations. A histodram of A/E liability costs (C4) was plotted to 
determine if anormal distribution of data existed. (A histogram is a graph 
of a frequency distribution in the form of a series of rectangles, each 
proportional in width to the range of values within a class and proportional 
in height to the number of items falling in the class.) The midpoint of the 
range and the count of the number of items falling in each range, along with 
a graphical presentation of the distribution, are included in Appendix C. 
Regression models are based on ".. .the assumption that the values of the 
dependent variable are normally distributed’ (Ref. 10, p. 3, Chap. 4]. The 
data for the dependent variable, A/E liability costs (C4), were not normally 
distributed The data can be transformed in several ways, however, to 
obtain a more normal distribution. The logarithmic, square root and 
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reciprocal functions were the only ones used during this study. For 
instance, the scuare root of each dependent data point of A/E liability (C4) 
was taken and then plotted on a histogram. The square root of A/E liability 


(C10) was the new variable. The histogram of the square root of A/E 
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liability (C10} was more normally distributed than the nistegram oF A/e 
Nability (C4). The square root of A/é tlapility (ClO) was then regressed 
against A/E costs (CS) for an R< of 6.6%. R* values of less then 30% for 
the regression moael indicate a very poor predictor. [herefore, other 
regressions were tried. 

The square root of A/E jiabtlity (C19) -was then recressec against ins 
CONSETUCTION COSTS 4UG) TO  Croguceran Qe of 0.9%. The A/E liability (C4) 
was then regressed against the construction costs (C6) to see if a better 
model could be obtained without the transformation. R* was 1.7%. The 
dependent variable may also be transformed. First the histograms of A/E 
costs (CS) and construction costs (C6) were plotted to determine that the 
data were not normally distributed for either. Three transformations of A/E 
costs (C5) were then tried (the reciprocal, the logarithm, and the square 
root) and the associated histograms were plotted. The square root of A/t 
costs (C13) produced the most normai distribution. Three transformations 
of the construction costs (C6) were tried (the reciprocal, the logarithm, 
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and the square root) and the associated histograms were plotted. The 
Square root of construction costs (Cl7) produced the most normal 
distribution. The square root of A/E liability (C10) was regressed against 
the square root of A/E costs (C13) for an RZ of 5.0%. The square root of 
A/E liability (C10) was regressed against the square root of construction 
costs (C17) for an R* of 5.9%. 

Next, a multiple regression was performed. The square root of A/E 
lability (ClO) was regressed against both the square root of A/E costs 
(C13) and the square root of construction costs (C17), producing an R= of 
6.8%. Not only was the R~ value very 'ow, but the t-ratio values of 0.54 for 
Ciemegvene fneOl Of A/E costs (Cl3) and 0.7/7 for the square root of 
construction costs (C17) were also too low. The t-ratio provides a test for 
the statistical significance of the regression line. “A high t-value indicates 
that the incependent variable is important in explaining the value of Y. 
Generally, t-values of greater than two are desired..."[Ref. 10, p. 17, Chao. 11. 

AL this point in the analysis no significant relationship had been found, 
SO aplot of the A/E liability (C4) against the A/E costs (C5) and a plot of 
A/E liability (C4) against construction costs (CS) were reviewed for out- 
liers. Data points which are far beyond the main body of data are considered 
outliers; outliers can skew the regression analysis and should be eliminated. 
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During the regression analysis the computer program notes unusual 
observations. These unusual observations in this study included data points, 
by number, with a large influence on the regression equation. Outlier data 
points #9, 14 and 25 are circled in Appendix C on the plot of A/E liability 
(C4) against A/E costs (C5). Outlier data point *8 is circled in Appendix C 
on the plot of A/E liability (C4) against construction costs (C6). These four 
data points were all outside the main body of the data because of extremely 
high construction costs or A/E design costs relative to the other NC data 
points. These four data points were then deleted from the NC data. The NC 
data were then printed showing only 30 rows of data elements. Note that 
column Ci contains the original numerical listing for the project. The same 
process was used to evaluate the revised NC data. First the square root of 
A/E Htability (C20) transform was calculated and its histogram was plotted. 
The regression of A/E liability (C4) against A/E costs (C5) produced an Re 
of 10.9% and the regression of the square root of A/E liability (C20) against 
A/E costs (C5) resulted in an R* of 12.1%. Next, the A/E liability (C4) was 
regressed against the construction costs (C5) and generated an R of 26.2%. 
The regression of the square root of A/E liability (C20) against the 
construction costs (C6) produced an Ré of 210%. These values of Re are 
much better than what was obtained before deleting the outliers. However, 
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the values are still very low. Some additional steps were taken Co try for 
Imoroved RZ Values. 

The square root of A/E costs (C21) was calculated and the histograms 
for A/E costs (C5) and the square root of A/E costs (C21) were plotted, and 
rey SNOW thal the squere root of A/E costs (C21) is more normaly 
distributed. Regressing the square root of A/E liability (C2Q) against tne 
square root of A/E costs (C21) resulted in an R¢ of onl ¥ 3.8%. Tne square 
root of construction costs (C22) was calculated and the histograms for 


construction costs (C6) and the square root of construction cost 


w) 


‘fe thee ee 


fees 1. ec, naicaling that the square root of construction casts (C22) *< 
more normally distributed. The square root of A/E liability (C20) was then 
regressed against the square root of construction costs (C22), producing an 
R¢ of 20.9%. Once again the plots of A/E liability (C4) against construction 
costs (C6) and A/E liability (C4) against A/E costs (C5) were reviewed for 
outliers. Outlier data points #5, 16 and 20 were circled on the plots and 
then deleted from the NC data. 

After the revised data were printed, the square root of A/E liability 
(C30) was calculated and its histogram plotted. The regression of A/E 
liability (C4) against A/E costs (C5) resulted in an R* of 29.0%, while the 
regression of the square root of A/E liability (C30) against A/E costs (C5) 
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produced an R2 of 26.9%. The Square root of A/E costs (C31) and its 
histogram were calculated and plotted. The regression of the square root 
of A/E liability (C30) against the square root of A/E costs (C3!) generated 
an R¢ of 19.0%. An R¢ value of 28.5% resulted from the regression of A/E 
liability (C4) against the construction costs (C5). [ne histogram cf tne 
construction costs (C6) was olotted and compared fo the foliewing 
transformations: the square root of construction costs (C32) and tne 
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logarithm of construction costs (C33). None of these histograms was very 
normally distributed; however, the logarithm of construction costs (C33) 
was the closest. Tne A/E liability (C4) was regressed against tre fogaritnm 
OfACONSERUGEION COSTSINE SS) ares uiEinaa teas R< of 16.1%. The regression 
the square root of A/E liability (C30) against the logarithm of construction 
costs (C33) produced an R* of 16.3%. The multiple regression of A/E 
liability (C4) against both A/E costs (CS) and construction costs (C6) 
produced an QZ of 32.9%. However, the t-ratios for both independent 
variables were too low and the equation should not be used. Another 
multiple regression was tried - A/E liability (C4) against beth the square 
root of A/E costs (‘C3}) and the square root of construction costs (C32). 
This produced an R* of 24.4%. The regression of A/E liability (C4) against 


the square root of construction costs (C32) resulted in an R¢ of 22.1%, 
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while the regression of the square root of A/E costs (C30) against the 
Square root of construction costs (C32) produced an R< of 17.6%. 

From this point in the analysis the plots of A/E liability (C4) against A/E 
costs (CS) and AZE liability against (C4) construction costs (C6) were again 
reviewed. Twomore outliers were deleted from the data and the regression 
analysis was started over again. The resulting regressions, using both . 
nontransformed and transformed data, generated R¢ values of less than 13%, 
less than those produced before deleting the last two outliers. One more 
data point was deleted in an effort to improve the regression equations. 
Once again, the resulting R¢ values were very low. No more data points 
were then deleted. The regression equations were not improving with the 
deletion of more data and the data base was getting too small to be 
representative of the initial data. 

After deleting seven data elements the best result was an RS of 29.0%. 
The regression equation is: A/E liability (C4) = -4741 + 0.0898 x A/E costs 
(CS). The t-ratio for the independent variable was 3.19. This means that 
29.0% of the variation in A/E liability can be explained by the change in A/E 
costs. Even though this was the best relationship for A/E liability, the 
regression equation does not provide enough of a relationship to be useful in 


predicting AZE liability. 


C. REGRESSION ANALYSIS OF COMPLEX PROJECTS 

There were 46 projects inthe complex category from the data in Appen- 
dix A. Appendix D contains the data and all of the regression analysis. The 
A/E costs and the construction costs were each multiplied by one thousand 
so that all of the data were in the same units. The five columns in Appendix 
D contain the same types of data as the five columns in Appendix C. The 
first two regressions run were A/E liability (C4) against A/E costs (C5), 
producing an R+ of 0.0%, and A/E liability (C4) against construction costs 
(C6), resulting in an R of 1.0%. The Nistogram of A/E liability (C4) was 
plotted and then compared to the histogram of the transformation of the 
Square root of A/E liability (C10). The square root of A/E liability (C10) 
was regressed against A/E costs (C5), resulting in an R* of 0.4%. Then the 
square root of A/E liability (C10) was regressed against construction costs 
(C6), producing an R* of 0.8%. The histograms of A/E costs (CS) and 
construction costs (C6) were then plotted and compared to several! 
transforms. The reciprocal, logarithm, and square root transforms were 
tried on the A/E costs (C5); the resulting histograms indicated that the 
logarithm of A/E costs (12) was the most normal. The logarithm and the 
Square root transforms were then tried on the construction costs (C6); the 
respective histograms indicated that the square root of construction costs 
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(CIS) was the most normal. A/E liability (C4) was then regressed against 
the logarithm of A/E costs (C12), producing an R¢ of 2.9%. The square root 
of A/E liability (C10) was regressed against the logarithm of A/E costs 
(C12), resulting in an R* of 5.8%. Two more feCieootonc vere Cen 
performed, The first was A/E Jiability (C4) against the square root of 
@enstruction costs (C15), pegureny in an R¢ of 1.9%. The second was the 
Square roct of A/E liability (CIO) against the square root of construction 
costs (Ci5), producing an Re Tee ee we itielere regression cf ine sGuers 
root of A/E liability (C1O) against the logarithm of A/E costs (Ci 2} and tne 
Square root of construction cosis (C15) generated an Re of 5.2%. None of tne 
R values was Significant, so tne plots of A/E liability (C4) against A/E 
costs (CS) and A/E liability (C4) against construction costs (C6) were 
reviewed for outliers. Data elements * 4, 38 and 40 are outliers and Nave 
been circled on the plots in Appendix D. 

Those three data elements were then deleted from the set of complex 
data, and the revised data were printed in Appendix D. The A/E liability (C4) 
was transformed into the square root of A/E liability (C20) and both 
histograms were plotted. The square root of A/E liability (C20) was more 
normally distributed. Four regressions were Chen run fo check for a good 
model. The A/E liability (C4) was regressed against A/E costs (C5), the 


os 


Square root of A/E liability (C20) was regressed against A/E costs (C5), 
A/E liability (C4) was regressed against construction costs (C&S), and the 
Square root of A/E liadility (C20) was regressed against construction costs 
(C6). These four regressions generated respective Re Values of 5.4%, 6.2%, 
0.9%, and 0.7%. The A/E costs (C5) were then transformed into the square 

root of A/E costs (C21) and the logarithm of A/E costs (C22). The 
histograms of all three were plotted, and indicated that the square root of 
A/E costs (C21) was more normal. The construction costs (C6) weneiiga 


transformed into the square rcot of construction costs (C25), and tne 


) 


mistogram iundicated this was a normal distribution. ine tollowing five 
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regressions were thenrun: A/E liability (C4) against the square root of A/E 
costs (C21), the square root of A/E liability (C20) against the square root 
of A/ZE costs (C21), A/E liability (C4) against the square root of 
construction costs (C23), the square root of A/E liability (C20) against the 
square root of construction costs (C23), and finally, the multiple regression 
of the square root of A/E liability (C20) against both the square root of A/E 
costs (C21) and the square root of construction costs (C23). The respective 
RZ values of these five regressions were /.2%, 8.5%, 3.0%, 313%, anG@aeom 
None of the R2 values was Significant, so the plots of A/E liability (C4) 
against A/E costs (CS) and A/E liability (C4) against construction costs 
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(C6) were reviewed for outliers. Data elements # 4, 11 and 33 are outlier 
and have been circled on the plots in Appendix D. 

The three outliers were then deleted from the set of complex data, and 
the revised data were printed in Appendix D. The A/E liability (C4) was 
transformed into the square root of A/E liability (C30), and both histograms 
were plotted. The square root of A/E lability (C30) was more normal'y 
distributed. Four regressions were then run to check for a good model. Tne 
A/E liability (C4) was regressed against A/E costs (CS), the square root or 
A/E liability (C50) was regressed against A/E costs (C5), A/E liability (C4) 
was regressed against construction costs (C6), and the oe root of A/E 
ability (C50) was regressed against construction costs (C6). These 
regressions produced R¢ values of 1.6%, 3.6%, 8.4%, and 7.3%, respectively. 

The A/E costs (CS) were then transformed into the square root of A/E 
costs (C31) and both histograms were plotted, indicating that the scuare 
root of A/E costs (C31) was a little more normally distributed. The 
construction costs (Co) were transformed into the square root of 
construction costs (C32) and the logarithm of construction costs (C33). 
The three associated histograms were plotted, indicating that the 
construction costs (C6) were the most normal. Regressions against the 
Square root of construction costs (C32) and the logarithm of construction 
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| costs (C33) were not run because the constmMiGtion costs (G6) wae amiG—. 
normally distributed. The following three regressions were run: A/E 
liability (C4) against the square root of A/E costs (C31), the square root of 
A/E liability (C30) against the square root of A/E costs (C31), and the 
multiple regression of the square rceot of A/E liability (C30) against beth 
the square root of A/E costs (C5!) and the construction costs (C6). These 
tests resulted in respective R¢ values of 42%, 6.9%, and 8.8%. The Ré value 
of 8.8% was disregarded because of the very low t-ratio values of 0.79 for 
the square root of A/E costs (C3!) and 0.90 for the construction costs (Cé&)., 
None of the R+ values was Significant, so the plots of A/E liaoility (C4) 
against A/E costs (CS) and A/E liability (C4) against construction costs 
(C6) were reviewed for outliers. However, no more data points were 
deletec. The regression equations were not improving with the deletion of 
more data, and the data base was becoming too small to be representative of 
the initial data. 

The regression of the square root of A/E liability (C20) against the 
square root cf A/E costs (C21), after deleting three data elenienimey 
produced the best R¢ value of 8.5%. The regression equation Is: the square 
root of AZE liability (C20) = 45.5 + 0.119 x the square root of A/E costs 
(C21). The t-ratio for the independent variable was 195. This means that 
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8.5% of the sample variation in the square root of A/E liability can be 
explained by the change in the square root of A/E costs. Even though this 
was the best relationship for A/E liability, the regression equation does not 


_ provide enough of arelationship to be useful in predicting A/E liability. 


D. REGRESSION ANALYSIS OF VERY COMPLEX PROJECTS 

; There were |{ projects in the very compiex category from the data in 
Appendix A. Aopendix E contains the data and all of the reqression analyses. 
The A/E costs and the construction costs were each multiplied by one 
thousand, so all of the data were in the same units. The five columns in 
Appendix & contain the same types of data as the five columns in Appendices 
C and D. The first two regressions run were A/E liability (C4) against A/E 
costs (C5), with an R2 of 3.5%, and A/E liability (C4) against construction 
costs (C6), resulting in an R* of 0.0%. The histogram of A/E liability (C4) 
was plotted and then compared to the histogram of the transformation of 
the square root of A/E liability (C10). The square root of A/E liability 
(C10) was regressed against A/E costs (C5), resulting in an R* of 5.4%. 
Then the square root of A/E liability (C10) was regressed against 
construction costs (C6), producing an R2 of 0.5%. The histograms of A/E 
costs (CS) and construction costs (C6) were then plotted and compared to 
several transforms. The square root, logarithm, and reciprocal transforms 
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were tried on the A/E costs (CS); the resulting histograms indicated the 
Square root of A/E costs (11) was the most normal. The square root and the 
logarithm transforms were tried on the construction costs (C6); the 
respective histograms indicated tnat the logarithm of construction costs 
(C15) was the most normal. A/E liability (C4) was then regressed against 
the square root of A/E costs (C11), producing an R¢ of 2.6%, Tne Square #oee 
of A/E liability (C10) was regressed against the square roct of A/E costs 
(Cli), resulting in an R+ of 42% Two more regressions Were item 
performed. Tne first was A/E liability (C4) against tne Icaarithm of 
construction costs (C15), resulting in an R< of 1.5%. The secone was tne 
Square root of A/E liability (C10) against the logarithm of construction 
costs (C15), producing an RZ of 0.2%. A multiple regression of the square 
root of A/E liability (C1Q) against the A/E costs (C4) and the construction 
costs (C6) generated an R* of 5.8%. None cf the R* values was Slanificant, 
So the plots of A/E liability (C4) against A/E costs (CS) and A/E liability 
(C4) against construction costs (C6) were reviewed for outliers. Data 
element *3 is an outlier and has been circled on the piots in Appendix E. 
Data element *3 was then deleted from the set of very complex data, and 
the revised data were printed in Appendix E—. The A/E liability (C4) was 
transformed into the square root of A/E liability (C20) and both histograms 
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were plotted, indicating that the square root of A/E liability (C20) was 
more normally distributed. Four regressions were then run to check for a 
good model. The A/E liability (C4) was regressed against A/E costs (C5), 
the square root of A/E liability (C20) was regressed against A/E costs (CS), 
A/E liability (C4) was regressed against construction costs (C6), and the 
Square root of A/E liability (C20) was regressed against construction costs 
Meeyeeinese regressions resulted in R2 values of 9.7%, 6.6%, 93%, and 
11.6%, respectively. The A/E costs (Cd) were then transformed into the 
Square root of A/E costs (C21), the logarithm of A/E costs (C22) and the 
reciprocal of AVE costs (C23). The histograms of all four were plotted, 
indicating that the square root of A/E costs (C21) was most normal. fhe 
construction costs (C6) were then transformed into the square root of 
construction costs (C24) and the logarithm of the construction costs (C25). 
The histograms indicated the logarithm of the construction costs (C25) was 
anormal distribution. The following five regressions were then run: A/E 
lability (C4) against the square root of A/E costs (C2!), the square root of 
A/E liability (C20) against the square root of A/E costs (C21), A/E 
liability (C4) against the logarithm of construction costs (C25), the square 
root of A/E liability (C20) against the logarithm of construction costs 
(C25), and finally the multiple regression of the square root of A/E liability 
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(C20) against both the A/E costs (C5) and the construction costs (C6). The 
resulting RZ values were 3.1%, 5.4%, 2.6%, 3.6%, and 17.3%, respectively. 
None of the R* values was Significant, so the plots of A/E Hiability (C4) 
against A/E costs (C5) and A/E liability (C4) against construction costs 
(C6) were reviewed for outliers. Data element #9 is an outlier and has been 
circled on the plots in Appendix E. 

This data element was then deieted from the set of very complex data and 
the revised data were printed in Appendix —. The A/E liability (C4) was 
transformed into the square root of A/E liability (C30), and both histograms 
were plotted. The square root of A/E liability (C30) was more normally 
distributed. Four regressions were then run to check for a good model. The 
A/E liability (C4) was regressed against A/E costs (C5), the square root of 
A/E liability (C30) was regressed against A/E costs (C5), A/E liability (C4) 
was regressed against construction costs (C6), and the square roct of A/E 
liability (C30? was regressed against construction costs (€6). The 
respective RZ values were 14.7%, 6.8%, 8.4%, and9.9%. The A/E costs (CS) 
were then transformed into the square root of A/E costs (C31), the 
logarithm of A/E costs (C32) and the reciprocal of A/E costs (C33). The 
four histograms were plotted, indicating the square root of A/E costs (C31) 
was a little more normally distributed. The construction costs (C6) were 
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transformed into the square root of construction costs (C34) and both 
histograms were plotted, indicating the square root of construction costs 
(C34) was a little more normally distributed. The following five 
regressions were thenrun: A/E liability (C4) against the square root of A/E 
costs (C51), the scuare root of A/E liability (C30) against the scuare roct 
OF A/E costs (C51), the A/E liability (C4) against the square coor of 
construction costs (C34), the square root of A/E liability (C30) against the 
square root of construction costs (C54), and the multiple rearession of A/E 
Nabiiity (C4) against both tne square root of A/E costs (C31) ana the square 
MOOT OlNGORSTrUCtION COSTS (C34). 
R¢ values of 17.3%, 7.7%, 6.0%, 8.5% and 21.6%. The R@ value of 21.6% was 
disregarded because of the very low t-ratio values of -1.09 for the square 
root of A/E costs (C31) and -0.57 for the square root of construction costs 
(C34), None of the R¢ values was significant, so the plots of A/E liability 
(C4) against A/E costs (CS) and A/E liability (C4) against construction 
costs (CO) were reviewed for outliers. No more data points were deleted. 
The regression equations were not improving significantly with the deletion 
of more data, and the data base was becoming too small to be representative 


of the initial data. 
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The regression of A/E liability (C4) against the square root of A/E costs 
(C31), after deleting two data elements, produced the best Re value of 
17.3%. The regression equation is: A/E liability (C4) = 72507 - 19.8 x the 
Square root of A/E costs (C31). The t-ratio for the independent variable 
was -1.2]1. This means that 17.3% of the sample variation in A/E lability 
can be explained by the change in the square root of A/E costs but not 
Significantly because of the low t-ratio. Even though this was the best 
relationship for A/E liability, the regression equation does not provide 
enough of arelationship to be useful in predicting A/E liability. 

The three data sets (not complex, complex and very complex) reviewed 
did not produce a significant relationship for predicting the future amount 
of A/E liability. One last effort to form a predictive model was attempted. 
The three data sets were combined into one total data set, which was 


Studied next. 


E. REGRESSION ANALYSIS OF ALL PROJECTS 

The three categories of projects combine for a total of 91 projects. 
Appendix F contains the data, compiled in order of complexity from not 
complex to very complex, and all of the regression analyses. The A/E costs 
and the construction costs were each multiplied by one thousand so that all 
of the data were in the same units. The five columns in Appendix F contain 
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the same types of data as the five columns in Appendices C, D and —. After 
the histogram of A/E liability (C4) was plotted and then compared to the 
histogram of the square root of A/E liability (C10), the first four 
regressions were run. These were: A/E liability (C4) against A/E costs 
(C5), the square root of A/E liability (C10) against A/E costs (C5), A/E 
lability (C4) against construction costs (C6), and the square root of A/E 
lability (C10) against construction costs (C6). These regressions resulted 
in the respective Re values of 0.1%, 0.5%, 1.9%, and 1.6%. The histograms of 
A/E costs (CS) and construction costs (C6) were then plotted and compared 
to several transforms. The square root, logarithm, and reciprocal 
transforms were tried on the A/E costs (CS); the resulting histograms 
Indicated the logarithm of A/E costs (12) was the most normal. The square 
root, logarithm, and reciprocal transforms were tried on the construction 
costs (C6); the respective histograms indicated that the square root of 
construction costs (C14) was the most normal. A/E liability (C4) was then 
regressed against the logarithm of A/E costs (C12), producing an R¢ of 2.7%, 
The square root of A/E liability (C10) was regressed against the logarithm 
of A/E costs (C12), resulting in an R¢ of 49%. Two more regressions were 
then performed. The first was A/E liability (C4) against the square root of 
construction costs (C14), resulting in an R* of 3.5%. The second was the 


45 


square root of A/E liability (C10) against the square root of construction 
costs (C14), producing an RZ of 5.0% A multiple regression of the square 
root of A/E liability (C10) against the logarithm of A/E costs (C12) and the 
square root of construction costs (C14) generated an R¢ of 6.1% None of 
the R< values was Significant, so the plots of A/E liability (C4) against A/E 
costs (C5) and A/E liability (C4) against construction costs (C6) were 
reviewed for outliers. Data elements 738, 74, 81, 83, and 86 are outliers 
and nave Deen circled on the plots in Appendix F. 

The five data elements were then deleted from the set of data and the 
revised data were printed in Appendix -. The A/E liability (C4) was 
transformed into the square root of A/E liability (C20) and both histograms 
were plotted, indicating the square root of A/E liability (C20) was more 
normally distributed. Four regressions were then run fo check for a good 
model. The A/E liability (C4) was regressed against A/E costs (C5), the 
square root of A/E liability (C20) was regressed against A/E costs (C5), 
A/E liability (C4) was regressed against construction costs (C6), and the 
square root of A/E liability (C20) was regressed against construction costs 
(C6). The respective R2 values were 43%, 6.1%, 0.6%, and 0.4%. The A/E 
costs (C5) were then transformed into the logarithm of A/E costs (C21) and 
the histograms were plottea, indicating that the logarithm of A/E costs 
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(C21) was more normal. The construction costs (C6) were then transformed 
into the square root of construction costs (C22) and both histograms were 
plotted, indicating the square root of the construction costs (C22) was a 
normal distribution. The following five regressions were then run: A/E 
liability (C4) against tne Jogarithm of A/E costs (C21), the square root of 
A/E liability (C20) against the logarithm of A/E costs (C21), A/E lability 
(C4) against the square root of construction costs (C22), the square root of 
A/E Hiability (C20) against tne square root of construction costs (C22), and 
finally, the multiple regression of the square root of A/E lability (C20) 
against both tne A/E costs (CS) and the square root of construction costs 
(C22). The resulting R* values were 3.6%, 44%, 3.0%, 3.3%, and 6.6%, 
respectively. None of the Re values wa Significant, so the plots of A/E 
Nability (C4) against A/E costs (CS) and A/E liability (C4) against 
construction costs (Co) were reviewed for further outliers. Data elements 
#8, 9, 45, 71, 81, and 86 are outliers and have been circled on the plots in 
Appendix F. 

Those six data elements were then deleted from the set of data and the 
revised data were printed in Appendix F. The A/E liability (C4) was 
transformed into the square root of A/E liability (C30) and both histograms 
were plotted, indicating the square root of A/E liability (C30) was more 
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normally distributed. Four regressions were then run to check for a good 
model. The A/E liability (C4) was regressed against A/E costs (C5), the 
Square root of A/E liability (C30) was regressed against A/E costs (C5), 
A/E liability (C4) was regressed against construction costs (C6), and the 
square roct of 4/E liability (C30) was rearessed against construction costs 
(CS). Respective Re Values were 71.6%, 175%, 11.7%, and 10.20% ine Ase 
costs (CS) were then transformed into the logarithm of A/E costs (C31) and 
both histograms were clotted, indicating the logarithm of A/E costs (C31) 
was a little more normally distriputed. The construction costs (C&) were 
transformed into the square root of construction cosis (C32 7aiaiae 
Nnistograms were plotted, indicating that the square root of construction 
costs (C32) was normally distributed. The following five regressions were 
then run: A/E liability (C4) against the logarithm of A/E costs (C31), the 
Square root of A/E liability (C30) against the logarithm of A/E costs (C31), 
the A/E liability (C4) against the square root of construction costs (C32), 
the square root of A/E liability (C30) against the square root of 
construction costs (C32), and the multiple regression of A/E liability (C4) 
against both the A/E costs (CS) and the square root of construction costs 
(C32) These regressions resulted in respeewime R* values of & 3.4%, 7.0%, 
14.0%, 13.1% and 23.0%. The R¢ value of 23.0% was disregarded because of 
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the very low t-ratio value of 1.19 for the square root of construction costs 
(C32). None of the R* values was Significant, so the plots of A/E Hability 
(C4) against A/E costs (CS) and A/E liability (C4) against construction 
costs (C6) were reviewed again for outliers. No more data points were 
deleted. The regression equations were not improving with the deletion of 
more data, and the data base was becoming too small to be representative of 
the initial data. 

The regression of A/E liability (C4) against A/E costs (CS) after deleting 
eleven data elements produced the best R+ value of 21.6%. The regression 
equation is: A/E liability (C4) = 9502 + 0.0460 x A/E costs (C5). The 
t-ratio for the independent variable was 4.64. This means that 21.6% of the 
Sample variation in A/E liability can be explained by the change in A/E 
costs. Even though this was the best relationship for A/E liability, the 
regression equation does not provide enough of arelationship to be useful In 


predicting A/E liability. 


F, CONCLUSIONS 

A good mathematical equation to predict the amount of A/E liability on 
future projects was not found by regression analysis. One other 
relationship was then tried. A chart of the complexity of the projects was 
assembled and is shown in Appendix G. Six ranges of A/E liability (A/E) 
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were established, as follows: 0 (no liability), 0 < A/E < $10,000, $10,000 < 
A/E < $50,000, $50,000 < A/E < $100,000, $100,000 < A/E «< $500,000, and 
A/E » $500,000. The number of projects in each range from each complexity 
category was then listed. fhe percent of this number with respect to the 
total in the complexity category was calculated. Then the percent of this 
number in each range for all categories was calculated. Along the bottom of 
the chart the number of projects in each range was totaled. The percent of 
this number within the total 91 projects was calculated. The percent of 
projects with liability within each complexity category was calculated and 
shown on the bottom of the chart. Finally, for each category, the percent of 
the total was multiplied by the percent with liability. From these figures it 
can be seen that there is about an equal likelihood of an occurrence of A/E 
liability in the not complex and complex categories and a slightly higher 
chance of occurrence in the very complex projects. Since the number of very 
complex projects was limited there was little sigrificance to the higher 
chance of A/E Jiability. Also, from the chart it can be seen that nearly three 
quarters (73.7%) of the projects had an A/E liability amount of less than 


$50,000. 


V. CONCLUSIONS AND RECOMMENDATIONS 


A. CONCLUSIONS 

This study found no significant relationships between A/E costs and /or 
construction costs and the amount of A/E liability. _ Although the data 
Indicate that the very complex projects are more likely to have A/E liability 
than the complex or not complex projects, the relationship is not 
Significant. No usable mathematical equation for predicting the amount of 
A/E jiability was obtained through regression analysis. No usable 
information for predicting the incidence of A/E liability was found by this 


study. 


B. RECOMMENDATIONS 

The data studied did not lead to a mathematical prediction model. 
Therefore, recommendations are based on the literature researched. The 
construction manager should require the A/E that designed the project to be 
actively involved with the construction of the project. Tne A/E should be 
informed of any required changes to the design during the course of 
construction. In the case of a large change order or one in which A/E 
liability may be present, the A/E should be required to make a site visit. 


>| 


During and subsequent to the site visit the A/E should provide one or more 
possible solutions to the construction problem. 

The Government will be better off with greater A/E involvement. If 
there is any question concerning the responsibility for a construction 
oroplem, the Government snouid offer to pay the A/E site visit costs and any 
design/redesign costs associated with the construction oroplem. ine A/E3 
design/recesign will probably be the best for the situation, because the A/r 
IS most familiar with tne criteria for the original aesign of the project. cn 
the case wnere the construction proplem is a result of an error or omission 
In tne eriginal aesign, the site visit_and aesign/redesign costs May cienaes 
recovered from the A/E. Although the cesign/redesign costs may be 
difficult to recover, they are very small in comparison to the change order 
construction costs; and the potential to save on construction costs Is great. 

If the Government pursues A/E liability, the A/E community will 
attempt to provide a better design to minimize liability. WIth greater 
involvement of the A/E, better and more timely solutions fo construction 
problems should result in lower change order costs. This means the Govern- 
ment will be getting improved designs from the A/E community. When 
construction problems arise, the problem will be resolved in a less costly 


manner - a“win-win" situation for all parties, especially the Government. 
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Future studies of A/E lability could be very beneficial to the Govern- 
ment. The A/E liability programs at all of the Navy's Engineering Field 
Divisions and large Officer in Charge of Construction offices could be 
analyzed to determine the best features, which could then be implemented 
at all divisions and commands. A couple of the attributes of these features 
could be as follows: a low A/E liability for the agency, measured by actual 
dollar or percent; and timely settlement of A/E liability, measured in days 
from the board decision to pursue unti! the settlement of A/E liability. New 
projects originating from WESTDIV are supposed to have weekly site visits 
pyethe A/c. These projects could also be analyzed to determine if the 
change order costs and the A/E Hability are less than in the group studied 


for this thesis. 


APPENDIX A 


COMPILED RESEARCH DATA 
mal ~ 2 ai #4 #5 my) 77 #8 *o Sie 
Job Description Complexity A/E adj Const adj A/E adj Set adj 
Year Designed costs A/E costs Const Ha A/E amt set 
Year Constructed 000 cost 00 0S lia mt 
1. Sub Training Fac. NC 74 95 2 48 0 0 0 0 
76D 80C 
2. Bldg Envir Control NC 92 116 664 984 4000 3520 0 0 
76D 78C 
3. Ror Hydrostatic Relief C 918 1166 1487 1308 4977 4380 Ip 0 
76D 78C 
4. Barracks NC 197 290 4890 6210 0 0 0 0 
76 
9. Steam/ Condensate Mod. G Rai 403 2732 3470 0 0 0 0 
76 
6. Life Safety Upgrade C 81] 1050. 1876 1745 0804 59398 IP = 
76D 80C 
7. Gas Processing Plant vc 6702 8512 18567 S41) 12234 10154 6117 35077 
76D 79C 
8. Elk Hills Tank Farm C 7614 9670 10100 10100 21000 21000 0 0 
760 77C 
9. Causeway/Bridge Alts ¢ 827 1050 15236 13236 0 0 0 0 
760 77C 
10. Alter Hanger #3 c 14 14 2426 2426 25500 25500 0 0 
77 
11. A/C Maint Training Fac C 391 391 9454 9454 12191 12191 12191 12191 
77 
12. Hanger Improvements NC 166 166 5180 3180 0 0 0 0 
77 
13. Pipe & Copper Shop VC 89 059 6928 6928 0 0 0 0 
77 
14. Rigging Shop G 078 0/78 7690 7690 67519 67519 24547 24547 
77 
15. A/C Maint Hanger C 656 636 8835 8833 0 0 0 0 
77 
16. Comp Med/Dent Clinic VC 1547 1347 9912 9912 1616460 1616460 Ip o 


77 


24 


| 
Job Description 


Year Designed 
Year Constructed 


17. Base Engr Maint Fac 
a 

18. Op Training Bldg Add 
77 

19. Oj! Spill Prev Fac 
V@ 

20, Salt Wtr Sys Imprv 
770 BIC 

21. SIMA Fac First Incre 
Val 

22. Med/Den Clinic 
7@ 

25. Alter/Addn Br Hosp 
77D 78C 

24. Industrial Waste Fac 
La 

20. Pipe & Boiler Shop 
Vi 

26. Steam Plant 
we \C 

27. Barracks 
77 

28. SRB Fac 
78 

29. Industrial Sec Fence 
78 

50. Helo Escape Train Fac 
78 

31. Family Housing Repair 
78 

52. Pier Improvements 
78 


*2 
Complexity 


NC 


VC 


VC 


VC 


NC 


NC 


NC 


COMPILED RESEARCH DATA 


5 
A/E 
costs 

000 


15 


147 


374 
561 
766 

48 
723 
1349 

10156 

412 

1539 
30 
74 
414 


418 


#4 

adj 

A/E 
costs 


13 


147 


992 


574 


06 | 


756 


48 


723 


1549 


10156 


41\2 


1554 


26 


65 


564 


368 


ye) 


iS 
Const 
costs 
000) 


3924 


1945 


1774 


5206 


0198 


7947 


452 


4300 


20566 


9900 


6882 


8686 


564 


1087 


36124 


4062 


*6 
adj 


Const 
costs 


3924 


1945 


1774 


2855 


9198 


7947 


098 


4500 


20566 


4896 


6882 


7644 


520 


997 


31789 


3979 


#7 
AVE 
lia 


100000 


115596 


Zim? | 


132298 


13498 


4950 


57038 


100000 


4228 


39000 


84858 


ZS 
adj 

A/E 
lia 


100000 


1 19696 


1885 10 


152298 


15498 


4950 


24064 


100000 


5720 


50800 


746795 


*9 #10 
Set adj 
amt Sel 

amt 
0 0 
0 0) 


100000 100000 


7800 6864 


724335 63741 


COMPILED RESEARCH DATA 


a4 eZ a5 #4 =o mG od 7! 7 ae 

Job Description Complexity A/E adj Const adj A/E adj Set adj 

Year Designed costs A/E costs Const lia A/E amt set 
Year Constructed 000) casts 000 costs lia amt 

33. Weap Sys Sup Fac c 435 361 4729 3925 10275 8528 Ip - 
79 

34. Flight Simulator Bldg ¢ Slee 427 ose9 2107 13470 11180 1p - 
79 

SoeoeaRacks NC 292 234 4415 5669 195182 16200! IP - 
79 

36. Com/Nex Comolex NC 664 55 | 9309 7726 107961 89508 Ip - 
79 

37. Elk Hills Tank Farm 6 820 763 0008 4615 18809 15611 9000 4150 
80 

38. Base Maint Fac NC 204 190 eio2 2066 5046 4693 2023 1881 
80 

39. Orgn Maint Fac Impro C So/ SNS) 6224 3/66 95658 88962 Ip - 
80 

40. Operational Train Fac C 379 352 3977 3699 44103 41016 IP = 
80 

41. Barracks NC 029 488 086 | 0451 43899 40826 IP - 
80 

42. Operational Train Fac C 48 | 447 4382 4075 108732 101121 14500 13485 
80 

43. Child Care Center NC 190 177 1925 1386 23944 17240 23944 17240 
80D 83C 

44. Adv Lghtwt Torp Shop C 624 980 7200 6692 100000 93000 0 0 
80 

45. Repair Housing NC 2010 1869 8585 7984 13514 12568 IP - 
80 

46. Tac Veh Maint Shop NC 28 | 261 2684 2496 0 0 0 0 
80 

47. Aircraft Maint Hor C 2 104 7049 6556 0 0 0 0 
80 

48. Rpr Piers 10 & 11 S Zo 70 1084 1008 0 0 0 0 


80 
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COMPILED RESEARCH DATA 


= #2 #5 *4 gi *6 ae +o 29. #30 

Job Description Complexity A/E adj Const adj A/E adj set adj 

Year Designed costs A/E costs Const lia A/E amt set 
Year Constructed 000) _costs 000 costs lia amt 

49. Elect Distribution Proj — C 79 73 1050 977 =3130 = 2911 700 00 | 
60 

50. Rpr Gateway Roofs C 1162 1082 4199 “3905 49106 45669 IP - 
80 

Of. int Maint Fac g 536 312 1875 1669 31097 27676 \P - 
80D 81C 

52. Emergency Pwr Lift NC 4 4 144 120 = 3710 3079 0) 0 
80 

035. Data Processing Ctr NC 446 415 4950 4604 10033 9331 0 0 
80 

04. Mission Ctr! Complex VC 786 701 S908 0150 95248 88581 \P - 
80 

oo. Operational Train Fac c Zo ry! Vee 651 5000 4450 0 0 
81 

06. Applied Inst Bldg NC 473 421 3214 2860 22839 20327 11000 9790 
8! | 

07. Barracks NC 967 861 13780 12264 84247 74980 IP = 
Bt 

08. Barracks NC 469 417 4053 5607 1S | 1007 900 386445 
Bt 

09, Nex Complex NC 168 150 2144 1908 28858 25684 Ip - 
Bt 

60. Bowling Alley NC 29 26 170 aie 1794 1615 641 577 
61D 82C | 

61. Reroof Bldgs NC Fe) 4 1086 967 82195 73154 IP = 
ol 

62. Engr Lab e S20 494 9243 4666 700 623 Ip - 
61 

63. Rpr Hanger Doors c 86 ‘di 879 794 7278 =: © || 1000 900 
8] 

64. Dry Dock Mooring VC 1254 1116 15824 =6. 14083 93962400 55536 Ip = 
Bt 


Si, 


COMPILED RESEARCH DATA 
= aii? mS #4 a) eG wad 0 #9 “a0 
Job Description Complexity A/E adj Const adj A/E adj Set adj 


Year Designed costs A/E costs Const lia A/E amt set 
Year Constructed 000 costs 000 costs lia amt 

69. Barracks NC 329 289 9049 4935 6127 S455 Ip - 
BI 

66. Barracks NC 1282 1141° 8990 8001 138826 123555 Ip = 
St 

67. Applied instr Bldg NC 473 42 | 1952 WO? (aro S59 1750 ises 
8] 

68. Utility Plant € 1679 1494 2927 2654 162260 146034 Ip - 
8 | 

69. Ror Control/Telemet & 68 61 974 ott 3425 3048 3205 2652 
St 

70. Alt Sat Cont Fac VC 1107 985 1588 1143 67397 48526 Ip - 
S1D 83C 

71. Child Care Center NC 223 198 952 847 3098 2757 1648 1467 
81 

72. Ocean Systems Lab S 703 626 4722 4203 6166 5488 IP = 
81 

73. Child Care Center NC 94 84 695 619 2743 244) Ip = 
SI 

74, Dinning Fac Mod C 176 157 1788 1591 8284 7373 Ip = 
8} 

79. Barracks NC 459 409 ore | 4567 10015 8913 Ip = 
SI 

76. Dining Fac NC 120 107 960 854 22150 19714 Ip z 
BI 

77. Trident Motor test Fac VC 23/9 =a / 6844 6091 224403 199719 Ip - 
8] 

78. Cold Storage Whse c 104 93 800 712 230000 204700 0 0 
8] 

79. Roof Repairs NC 7 159 97 87 0 0 0 0 
82 

80. LCAC Complex VC 3814 3433 7499 5399 243534 817520 Ip - 
82D &3C 
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COMPILED RESEARCH DATA 


et 2 ae *4 <2 26 a7 | zo #0 21.0 

Job Description Complexity A/E adj Const adj A/E adj Sel adj 

Year Designed costs A/E costs Const lia A/E amt set 
Year Constructe (O00) ec 000 casts | amt 

81. Elect/Comm Shop : OT ais 2299 2066 1922 1370 Ip - 
82 

82. Guided Missile Lab c 859 is 6166 3949 851484 766336 Ip - 
82 

83. Barracks NC 924 472 1455 1295 71414 §3558 0 0 
82 | 

84, Parachute Ror Shop C 208 187 1949 1394 6623 096 | ip - 
82 

85. Storm Drain System NC 120 108 135 Io 3000 2160 IP - 
B2bReSC 

86. Ror Elect Service € Tis 68 7 4} 0 0 9) Q 
32D 83C | 

37. Racquetball Courts NC IA 192 179 161 8000 7200 IP - 
82 

88. Addn To Aeromedical C 89 80 779 701 1689 1920 Ip - 
82 

89. Maint Hanger C 729 SAS 0240 3773 13344 9608 IP - 
83 

90. Training bldg C 90 65 982 698 2136 1431 480 322 
83D 84C 

91. Radar Fac Mod e 78 a2 779 S227 2/017 18503 IP - 
84 
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APPENDIX B 


NEW CONSTRUCTION INDEX* 


yéar inGix 100/ Indix correction 
factor 
79 66 100/66 = qeee 
76 79 100/79 = 27 
Va 1QQ Oly, ELC) — 1,00 
78 114 100/114 = 88 
79 121 =—s«4JOO/Zt= 8B 
80 108 100/108 = 93 
8 | 112 [CO rz 89 
82 114 100/111 = .90 
83 138 100/138 = shee 
84 130 100/150 = 67 


* Data fromref. 10 and 11 
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APPENDIX © 


REGRESSION ON NOT COMPLEX PROJECTS 


COLUMN 


Cl NUMERICAL LISTING OF THE PROJECTS 


C2 THE AZE COSTS IN THOUSANDS 


C3 THE CONSTRUCTION COSTS IN THOUSANDS 


C4 AZE LIABILITY COSTS 
C5 THE AE COSTS 
C6 THE CONSTRUCTION COSTS 


MTB > let c5 = 1000x%c2 
MTB > let c6 = 1000xc3 
MTB > print cl-c6 


ROW Cl C2 C3 C4 
1 i 35 G8 0 
2 io 584 3520 
3 3 250 6210 
4 S 166 3180 0 
5 5 13 3524 100000 
6 6 G12 6282 100000 
7 7 26 320 30800 
8 8 364 31789 0 
9 2 234 3664 Teeceou 

10 10 oo 1 7726 89608 
1 ual 190 2066 G693 
12 12 483 5451 G0S26 


13 is” «§6 ia 1386 17250 


14 14 1869 7984 12568 
15 15 261 2496 
16 16 4 120 3079 
17 17 415 4604 9331 
18 18 421 2860 20327 
19 19 861 12264 74930 
20 20 417 3607 1007 
21 21 150 1908 25684 
22 22 26 153 1615 
23 23 G 967 73154 
24 24 289 4935 545 
25 25 1141 8001 123555 
26 26 421 1737 5159 
27 27 198 847 2757 
28 28 84 619 2441 
29 29 G09 4567 8913 
30 30 107 854 19714 
oe 31 159 87 0 
32 32 472 1295 63558 
23 33 108 123 2160 
34 34 192 161 7200 

MTB > pegr c4 1 c5 

The rearession equation is 

C4 = 21366 + 0.0257 C5 

Predictor Coef Stdev 

Constant 21366 9725 

c5 0.02589 0.02003 

S$ = 41998 R-sq = 6.9% 


Analysis of Variance 


108000 
192000 


t-ratio 
2.20 
1.28 


R-sq(adj) = 1.9% 


SOURCE | DF ss Ms 
Regression Ll 2900965120 2900965120 
Error 32 564416643008 1763&01344 
Total 33 59342606336 


onl 


C6 


48000 
584000 
6210000 
3180000 
3524000 
6832000 
320000 


31788992 


3664000 
7726000 
2066000 
5451000 
1326000 
7954000 
2456000 

120000 
4604000 
2860000 


12264000 


3607000 
1908000 
153000 
967000 
4935000 
8001000 
1737000 
847000 
619000 
4567000 
&54000 
87000 
1295000 
Le00 
161000 


Unusual Observations 


Obs. C5 C4 Fit Stdev.Fit Residual St.Resid 
9 234000 162001 27377 7435 134624 3.26R 
14 1869000 12568 69380 31733 -56812 -2.07RX 
25 1141000 123555 50678 17841 72877 1.92 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives 1t large influence. 


MTB > hist c4 
Histogram of C4 N = 34 


Midpoint Count 
0 ome SS SSSSSSCSSS Ss SSS SSS 


20000 5S MRK 
40000 2 Xx 
60000 l x 
80000 3 *xxX 
100000 OC xX 
120000 l x 
140000 0 

160000 l x 


MTB > let cl10 = sart(c4) 
MTB > hist cld 


Histogram of C10 N = 34 
Midpoint Count 
0 


6 XRRKRX 

50 10 RXR RX 
100 G MRXX 

150 G XX 

200 QC x 

250 3 xX 

300 3 XxX 

350 l x 

400 l xX 


MTB > regr cl10 1 cS 


The regression equation is 
C10 = 102 +0.000082 C5 


Predictor Coef Stdev t-ratio 

Constant 101. 86 26.52 3.84 

cS 0.00008192 0.00005463 1.50 

§$ = 1146.5 R-sq = 6.6% R-sq(adj) = 3.6% 

Analysis of Variance 

SOURCE — DF Ss MS 

Regression 1 29497 29497 

Error 32 419770 13118 

Total 33 449266 

Unusual Observations 

Obs. cs C10 Fit Stdev.Fit Residual St.Resid 
9 234000 402.5 121.0 20.3 281.5 2.50R 
14 1869000 112.1 255.0 86.5 -142.9 -1.90 X 
25 1141000 351.5 195.3 48.7 156.2 1.51 X 


R denotes an obs. with a large st. resid. , 
X denotes an obs. whose X value gives it large influence. 


Gm 


MTB > regr cl1O0 1 c6 


The regression equation is 
C10 = 123 +0.000001 C6 


Predictor Coef Stdev t-ratio 
Constant 122.97 24.63 G.99 
C6 0.00000143 0.00000358 0.40 
Ss = 118.2 R-sq = 0.5% R-sq(adj) = 0.0% 


Analysis of Variance 


SOURCE DE ss MS 

Regression 1 2242 2242 

Error 32 447925 13970 

Total 33 449265 

Unusual Observatisns 
Ss. C& C10 Fit Stdev.Fit Residual St. Resid 
8 31788992 0.0 168.5 101.8 -163.5 ~2.80RX 
9 3664000 402.5 128.2 20.3 274.3 2.36R 


R denotes an obs. with a large st. resid. . 
X denotes an obs. whose X value gives ix large influence. 


MTB > regr cG 1 c6é 


The reqression equation is 
C4 = 259735 + 0.00096 CS 


Predictor Coef Stdev t-ratio 
Csns<ant 25973 8898 2.92 
C6 0.000964 0.001292 0.75 
S = 42694 R-Ssq = 1.7% R-sq(adj) = 0.0% 


Analysis of Variance 


SOURCE DF ss MS 

Regression 1 1014430976 1014430976 

Error 32 58328178688 1822755584 

Total 33 59342606336 

Unusual Observations 
S. C6 CG Fit Stdev.Fit Residual St. Resid 
8 31788992 0 56624 36766 -56624 -2.61RX 
9 3664000 162001 29506 7329 132495 3.15R 
25 8001000 123555 33687 9030 89868 2.15R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > hist c5 
Histogram of C5 N = 34 


Midpoint Count 
0 7 RX 


200000 1G RHR HH HH HHH HK HHH 
400000 Me tTTrr ssf : 
600000 1 x 

800000 1 x 

1000000 0 

1200000 1 x 

1400000 0 

1600000 0 

1800000 1 x 


MTB > hist cé6 


Histogram of C6 N = 34 
Midpoint Count 
0 LS — REE HEE HEE EE EE EE EE EEK 
5000000 Ll RRM 
10000000 G KR 
15000000 0 
20000000 0 
25000000 0 
30000000 1 x 


MTB > let cli =1/c5 
MTB >» hast cll 


Histogram of Cil N = 34 
Midpoint Count 
0.00000 VOM TS SSTSSS SSS SSIS SSE SCS SSS TTT TS 
9.90002 2 xx 
0.00004 CQ Rx 
0.00006 0 
0.00008 1 x 
0.00010 0 
0.900012 0 
0.00014 0 
0.00016 0 
0.90013 0 
0.000290 0 
0.00022 0 
0.00024 0 
0.390025 Co «xX 
MTB > let cl2 = logt(c5) 
MTB > Nist cl2 
Histogram of Cl2 N = 34 
Midpoint Count 
3.6 2 xx 
G.0 1 x 
G.4 2 Xx 
4.8 Co x 
5.2 12 RM 
5.6 12 HHH KKK HX 
6.0 2 xx 
6.4 1 x 
MTB > let c13 = sqrt(c5) 
MTB > hist cl13 
Histogram of C13 N = 34 
Midpoint Count 
0 Co xX 
200 G RX 
G00 LD —- EEE EEE HEHE EK 
600 12 RHI RN 
800 1 x 
1000 2 Xx 
1200 0 
1400 1 x 
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MTB > let c1i5 = 17 c6 
MTB > hist ci5 


Histogram of C1i5 N = 34 


Midpoint Count 

0.000000 Verme SST ICTS SS SSCS SSS SST IF | 
0.000002 HHH 

0.00000G ¥ 

0.000006 x 

0.000008 S 3 

0.000010 
0.000012 
0.000014 
0.000016 
0.000018 
0.000020 


MTB > let cl16 = logt(c6) 
MTB > hist ci16 


Histogram of C16 N = 34 


x 


MH OOOFMONNE Ul 


x 


Midpoint Count 

4 %x 

Pett 

% KX 

HRA 

Xe HEX DEKE HEE HE 
ret T TT TTS 
x 

x 


e 
o 
@ 
e 


SND OD DULLED 
DOMOHODN) Co 
Me OOUAE ND 


MTB > let c17 = sart(c6) 
MTB > hist c17 


Histogram of C17 N = 34 
meni Count 


1 x 

500 6 KRRRRX 
1000 7 RRR 
1500 BG eK 
2000 7 RRR 
2500 3 RX 
3000 3 XX 
3500 1 x 

4000 0 

G500 0 

5000 0 
5500 1 x 


oS 


MTB > regr c10 1 cl3 


The regression equation is 
C10 = 81.0 + 0.0947 C13 


Predictor Coef Stdev t-ratio 

Constant 81.01 41.69 1.94 

C13 0.09466 0.07299 1.30 

s = 115.5 R-sq = 5.0% R-sq(adj) = 2.0% 

Analysis of Variance 

SOURCE DF SS MS 

Regression 1 22431 22431 

Error 32 426836 13339 

Total 33 449266 

Unusual Observations 

Obs. C13 C10 Fit Stdev.Fit Residual St.Resid 
5 114 316.2 91.8 34.6 224.4% 2.0GR 
9 484 402.5 126.8 19.9 275.7 2.42R 
14 1367 112.1 210.4% 66.1 -98.3 -1.04 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr cl0 1 cl7 


The regression equation is 
C10 = 87.4 + 0.0251 C17 


Predictor Coef Stdev t-ratio 

Constant 87.40 35.16 2.49 

C17 0.02514 0.01778 1.61 

s = 115.0 R-sq = 5.9% R-sq(adj) = 2.9% 

Analysis of Variance 

SOURCE DF SS MS 

Regression 1 26428 26428 

Error 32 422838 13214 

Total 33 449266 

Unusual Observations 

Obs. C17 C10 Fit Stdev.Fit Residual St.Resid 
8 5638 0.0 229.1 73.8 -229.1 -2.60RX 
9 1914 402.5 135.5 20.3 267.0 2.36R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr ¢c10 2 cls cl7 


The regression equation is 
C10 = 74.6 + 0.0504 C13 + 0.0175 C17 


Predictor Coef Stdev t-ratio 
Constant 74.57 G2.79 1.74 
C13 0.05038 0.09350 0.54 
C17 0.01751 0.02288 0.77 
s = 116.2 R-sq = 6.8% R-sq(adj) = 0.7% 


6h 


Analysis of Variance 


SOURCE DF ss MS 

Regression 2 30352 15176 

Error esl 418914 13513 

Total 33 449266 

SOURCE DF SEQ SS 

Cis 1 22431 

C17 1 7922 

Unusual Observations 

Obs. Ci3 C10 Fit Stdev.Fit Residual St. Resid 
8 6903 0.0 203.7 88.3 -203.7 -2.69RX 
9 484 4602.5 132.5 24.3 270.90 2.36R 
14 1367 at2 5 1 192.9 70.4 -80.3 -0.87 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large infiuence. 


MTB > plot c4 cS 


oD 
150000+ 
C4 : 
c (*) 495 


LO0000+ x % 


- x x 
50000+ 


—— x 
a ae 


Gi K 
at 
ps 
iS 


0+ HHS 3x2 ‘i x* 
Ae 


eee eee eee fee ee ee eee fern ee ee en fer ee reer efocssrecscscestesc=s--- C5 
0 350000 700000 1050000 1400000 1750000 
MTB > pilot c4 c6 
= % 
150000+ 
CG = 
= + 
100000+ % % 
x 
= * % 
= % 
50000+ 
~ % 
- x xX 
- x x 
- * x 3 % 
O+ ri 3 3G HE - AO #3 
we nn nn fee weer en peer eer een pee sss sees fecee----- --C6 


0 6000000 12000000 18000000 34000000 30000000 


MTB > delete 25 cl-c6 
MTB > delete 14 ci-c6 
MTB > delete 9 cl-c6 
MTB > delete 8 cl-c6 
MTB > print cl-c6 
ROW Ci C2 cs CG 
L L 95 G8 0 
2 2 116 584 3520 
3 3 250 6210 0 
G 4 186 3180 0 
5 5 13 3524 100000 
6 6 G12 6882 100000 
7 7 26 320 20800 
8 10 551 7726 89608 
9 ll 190 2066 4693 
10 12 438 5451 460825 
11 is 17% 1336 17240 
i2 15 261 2496 0 
1s 16 4 120 3079 
14 17 415 G60G 9331 
15 18 G21 2860 20327 
16 19 861 12264 74980 
17 20 417 3607 1007 
13 et 150 1908 25684 
19 22 26 heey 1615 
20 23 G 967 73154 
21 24 289 4935 5453 
22 25 621 1737 5159 
23 aw 198 847 2757 
2G 28 8G 619 24G1 
25 29 G09 4567 8913 
26 30 107 8546 197214 
27 31 159 87 0 
28 32 472 1295 63558 
29 33 108 133 2160 
30 34 192 161 7200 
MTB > let c20 = sqrt(cG) 
MTB > hist c20 
Histogram of C20 N = 30 
Midpoint Count 
0 3K EX 
50 10 RMR RRR 
100 3 xxx 
150 G XXX 
200 o XxX 
250 3 xx 
300 3 xX 
MTB > regr cG 1 c5 
The regression equation is 
CG = 10380 + 0.0537 C5 
Predictor Coef Stdev 
Constant 10380 9220 
C5 0.05371 0.02901 
Ss = 31306 R-sq = 10.9% 


Analysis of Variance 


SOURCE 


Regression 


Error 
Total 


DF 

il 
28 
29 


3359245824 
27441987584 
30801231872 


108000 
192000 


t-ratio 
1.13 
1.85 


C6 


48000 
584000 
6210000 
3180000 
3524000 
6882000 
320000 
7726000 
20660090 
5451000 
1386000 
2496000 
129000 
4604000 
2860000 


12264000 


3607000 
1908000 
153000 
967000 
4935000 
1737000 
847000 
619000 
4567000 
354000 
87000 
1295000 
133000 
161000 


R-sq(adj) = 7.7% 


MS 


3359245824 
980070912 
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Unusual Observations 


Obs. cS C4 Fit Stdev.Fit Residual 
5 13000 100000 11078 8927 88922 
6 412000 100000 32507 7411 67493 
16 861000 74980 56621 18640 18359 
20 4000 73154 10594 9129 62560 


R denotes an obs. with a large st. resid. 


St.Resid 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c20 1 c5 


The regression equation is 
C20 = 72.5 +0.000178 C5 


Predictor Coef Stdev t-ratio 

Constant T2abs 28.75 2.52 

c5 0.00017769 0.00009046 1.96 

S$ = 97.62 R-sq = 12.1% R-sq(adj) = 9.0% 

Analysis of Variance 

SOURCE — DF ss MS 

Regression 1 36771 36771 

Error 28 266839 9530 

Total 29 303610 

Unusual Observations 

Obs. cS C20 Fit Stdev.Fit Residual 
5 13000 316.2 74.8 27.8 241.4 
16 851000 273.8 225.5 58.1 48.3 
20 4000 270.5 73.2 28.5 197.2 


R denotes an obs. with a large st. resid. 


St.Resid 
2.58R 
0.62 X 
2.11R 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c4 1 c6 


The regression equation is 
C4 = 7927 + 0.00583 C6 


Predictor Coef Stdev t-ratio 

Constant 7927 7229 1.10 

C6 0.005827 0.001846 3.16 

S$ = 28486 R-sq = 26.2% R-sqladj) = 23.6% 

Analysis of Variance 

SOURCE DF Ss MS 

Regression 1 8081002496 8081002496 

Error 28 22720229376 811436544 

Total 29 30801231872 

Unusual Observations 

Obs. C6 C4 Fit Stdev.Fit Residual 
5 3524000 100000 28460 5409 71539 
16 12264000 74980 79387 18374 -4407 
20 967000 73154 13561 6125 59593 


R denotes an obs. with a large st. resid. 


St.Resid 
2.56R 
-0.20 X 
2.14R 


X denotes an obs. whose X value gives it large influence. 
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MTB > regr c20 1 c6 


The regression equation is 
C20 = 71.7 +0.000017 C6 


Predictor Coef 
Constant 71.69 
C6 0.00001660 
s = 92.19 R-sq 


Analysis of Variance 


SOURCE 
Regression 
Error 
Total 


Unusual Observations 
C C2 


Obs. 
5 3524000 
16 12264000 
20 967000 


DF 

1 
28 
29 


Stdev t-ratio 
23.40 3.06 
0.00000598 2.78 
= 21.6% R-sq(adj) = 18.8% 
ss MS 
65626 65626 
237984 8499 
303610 


Fit Stdev.Fit Residual 


316.2 130.2 17.5 186.0 
273.8 275.3 59.5 = BE 
270.5 87.7 19.8 182.7 


R denotes an obs. with a large st. resid. 


St.Resid 
2.06R 


X denotes an obs. whose X value gives it large influence. 


MTB > let cel = sqrt(c5) 
MTB > hist c5 


Histogram of C5 N = 30 


Midpoint Count 
0 GS RRR 

100000 S xe MX 
200000 7 (RR 
300000 3 Xxx 
490000 G6 XKXXIX 
500000 2 XxX 
600000 l x 
700000 0 
800000 0 
900000 1 x 


MTB > hist cel 


Histogram of C2l N = 30 
Midpoint Count 
100 3 xX 
200 2 Xx 
300 S RRR 
400 7-H IEE 
500 3 KX 
600 G6 RRA 
700 3 RX 
800 0 
900 1 x 


MTB > regr c20 1 cel 


The regression equation is 
C20 = 74.8 + 0.0930 C21 


Predictor Coef Stdev t-ratio 
Constant 74.80 43.90 £70 
C22 0.09300 0.08790 1.06 
s = 102.1 R-sq = 3.8% R-sqladj) = 0.4% 
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Analysis of Variance 


SOURCE DF Ss 
Regression 1 11670 
Error 28 291940 
Total 29 303610 


Unusual Observations 


Obs. Cel C20 
5 114 316.2 
16 928 273.8 161.1 
20 63 270.5 80.7 


R denotes an obs. with a larce st. 


MS 
11670 
10426 


45.8 Li2.7 
38.9 189.8 
resid. 


Fit Stdev.Fit Residual 
5.4 3 2 


St. Resid 
2.G1R 
1.24 X 
2.-O1R 


X denotes an obs. whose X vaiue gives it large influence. 


MTB > let c22 = 
MTB > Hist ch 


Histogram of C& N = 30 


sart{c6) 


Midpoint Count 
0 7 RRR RR 
1000000 7-H EE 
2000000 G KX 
3000000 C Xx 
4000000 oC Xx 
5000000 G XX 
6000000 1 x 
7000000 1 x 
8000000 1 x 
9000000 0 
10000000 0 
11000000 0 
12000000 1 x 


MTB > hist c22 


Histogram of C22 N = 30 
Midpoint Count 
0 1 x 
500 6 KRKKKX 
1000 7 WRK 
1500 5S KKK 
2000 6 KERR 
2500 3 KX 
3000 1 x 
3500 1 x 


MTB > regr c20 1 c22 


The regression equation is 


C20 = 39.8 + 0.0544 C22 

Predictor Coef Stdev t-ratio 
Constant 39.83 32.99 leer 

Cee 0.05439 0.02000 2.72 

S = 92.62 R-sq = 20.9% R-sqladj) = 18.1% 
Analysis of Variance 

SOURCE _ DF Ss MS 
Regression 1 63423 63423 

Error 28 240187 8578 

Total 29 303610 
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Unusual Observations 

Obs. C22 C20 Fit Stdev.Fit Residual St. Resid 
16 3502 273.8 25005 45.0 43.5 0.54 X 

X denotes an obs. whose X value gives it large influence. 


MTB > plot c& c6 
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MTB > delete 


20 c 


1=ce 


MTB > delete 16 cl-c6 
MTB > delete 5 cl-c6 
MTB > print cl-c6 


ROW Cl C2 C3 C4 C5 C6 
1 1 95 G8 0 95000 48000 
2 2 116 584 3520 116000 584000 
3 3 250 6210 0 250000 6210000 
G G 166 3180 0 166000 3180000 
5 6 412 6882 100000 412000 6882000 
6 7 26 320 30800 26000 320000 
7 | 10 551 7726 89608 551000 7726000 
8 11 190 2066 4693 190000 2066000 
9 12 488 5451 40826 488000 5451000 
10 13 177 1386 17240 177000 1386000 
11 15 261 2496 0 261000 2496000 
12 16 4 120 3079 4000 120000 
13 17 415 4604 9331 415000 4604000 
14 18 421 2860 20327 421000 2860000 
15 20 417 3607 1007 417000 3607000 
16 21 150 1908 25684 150000 1908000 
17 22 26 153 1615 26000 153000 
18 24 289 4935 5453 289000 4935000 
19 26 421 1737 5159 421000 1737000 
20 27 198 847 2757 198000 847000 
21 28 84 619 2441 84000 619000 
22 29 409 4567 8913 409000 4567000 
23 30 107 854 19714 107000 854000 
24 31 159 87 0 159000 87000 
25 32 472 1295 63558 472000 1295000 
26 33 108 133 2160 103000 133000 
27 34 192 161 7200 192000 161000 
MTB > let c30 = saqrt(cG) 
MTB > hist c30 
Histogram of C30 N = 27 
Midpoint Count 
0 we 23 3 
50 10 «—- EE EEE EH EX 
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MTB > regr c4 1 c5 
The regression equation is 
C4 = - 4741 + 0.0898 CS 
Predictor Coef Stdev t-ratio 
Constant -4741 8193 -0.58 
cs 0.08981 0.02814 3.19 
S$ = 23098 R-sq = 29.0% R-sq(adj) 26.1% 
Analysis of Variance 
SOURCE DF ss MS 
Regression 1 5435138048 5435138048 
Error 25 13337370624 53349478G 
Total 26 18772508672 


(es 


Unusual Observations 


Obs. c5 CG Fit Stdev.Fit Residual St. Resid 
5 412000 100000 32260 6477 67740 3.06R 
7 551000 89608 G447GG 9700 44864 2.14R 


R denotes an obs. with a large st. resid. 
MTB > regr c30 1 cS 


The regression equation is 
C30 = 27.8 +0.000287 C5 


Predictor Coef Stdev t-ratio 
Constant 27.79 27.52 1.01 
cs 0.00028688 0.000909G453 3.03 
S = 77.60 R-sq = 26.9% R-sqladj) = 24.0% 


Analysis of Variance 


SOURCE DF Ss MS 

Regression a 55458 55453 

Error 25 150526 6021 

Total 26 205985 

Unusual Observations 

Obs. C5 C30 Fit Stdev.Fit Residual St.Resid 
5 412000 316.2 146.0 21.8 170.2 2.29R 


R denotes an obs. with a large st. resid. 


MTB > let c3l1 = saqrt(c5) 
MTB > Hist c3l 


Histogram of C31 N = 27 
Midpoint Count 
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MTB > regr c30 1 c3l 


The regression equation is 


C30 = - 0.6 + 0.214 C31 

Predictor Coef Stdev t-ratio 
Constant -0.63 43.71 -0.01 

Csi 0.21366 0.08837 2.42 

Ss = 81.72 R-sq = 19.0% R-sql(adj) = 15.7% 
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Analysis of Variance 


SOURCE DF ss MS 

Regression 1 39037 39037 

Error 25 166948 6678 

Total 26 205985 

Unusual Observations 

Obs. C31 C30 Fit Stdev.Fit Residual 
5 642 316.2 136.5 22.4% 179.7 
le 63 55.5 12.9 38.5 42.6 


R denotes an obs. with a large st. resid. 


St. Resid 
2.29R 
0.59 X 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c4 1 c6 


The regression equation is 
C4 = 2402 + 0.00617 C6 


Predictor Coef Stdev t-ratio 

Constant 2402 6474 0.37 

C6 0.006173 0.001955 3.16 

S$ = 23169 R-sq = 28.5% R-sql(adj) = 25.7% 

Analysis of Variance 

SOURCE ~ DF Ss Ms 

Regression 1 5352509440 5352509440 

Error 25 13419999232 536799744 

Total 26 18772508672 

Unusual Observations 

Obs. C6 C Fit Stdev.Fit Residual 
5 6882000 100000 44884 9829 55116 
7 7726000 89608 50094 11324 39514 
25 1295000 63558 10396 4956 53162 


R denotes an obs. with a large st. resid. 


St. Resid 
2.63R 
1.95 X 
2.35R 


X denotes an obs. whose X value gives it large influence. 


MTB > hist c6 
Histogram of Cé N = 27 
— Count 
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MTB > let c32 = saort(cé) 
MTB > Hist c32 


Histogram of C32 N = 27 
Midpoint Count 
400 7 
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MTB > let c33 = logt(c6) 
MTB > Hast c33 


Histogram of C33 N = 27 
Midpoint Count 
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MTB > regr cG 1 c33 


The regression equation is 


CG = - 81832 + 16367 C33 

Predictor Coef Stdev t-ratio 

Constant -81832 45522 -1.80 

C33 16367 7479 2.19 

s = 25103 R-sq = 16.1% R-sq(adj) = 12.7% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 1 3017996800 3017996800 

Error 25 15754510336 630180352 

Total 26 18772504576 

Unusual Observations 

Obs. C33 CG Fit Stdev.Fit Residual St.Resid 
5 6.84 100000 30080 7606 69920 2.92R 
7 6.89 89608 30903 7899 58705 2.46R 


R denotes an obs. with a large st. resid. 
MTB > regor c30 1 c33 


The regression equation is 


C30 = = 233 + 54.7 C33 

Predictor Coef Stdev t-ratio 

Constant -232.9 150.6 -1.55 

C33 54.67 24.73 2221 

Ss = 83.02 R-sq = 16.3% R-sql(adj) = 13.0% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 1 33668 33668 

Error 25 172316 6893 

Total 26 205985 

Unusual Observations 

Obs. C33 C30 Fit Stdev.Fit Residual St. Resid 
5 6.84 Sone 140.9 25.2 175.3 2.2eR 


R denotes an obs. with a large st. resid. 
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MTB > regr cG& 2c5 c6 


The regression equation is 


CG = - 3932 + 0.0526 C5 + 0.00345 C6 

Predictor Coef Stdev t-ratio 

Constant -3932 8154 -0.48 

c5 0.05258 0.04193 1.25 

C6 0.003455 0.002904 1.19 

s = 22908 R-sq = 32.9% R-sq(adj) = 27.3% 

Analysis of Variance 

SOURCE DF Ss MS 

Regression 2 6177984512 3088992256 

Error 24 12594520064 52467715864 

Total 26 18772504576 

SOURCE DF SEQ SS 

C5 1 5435138048 

cé6 1 742846976 

Unusual Observations 

Obs. C5 CG Fit Stdev.Fit Residual St.Resid 
5 412000 100000 41508 10084 58492 2.8GR 
25 472000 63558 25361 12899 38197 2.02R 


R denotes an obs. with a larse st. resid. 
MTB > regr cG 2 c3l1 c32 


The regression equation is 


C4 = = 11958 + 35.3 C31 + 9.63 C32 

Predictor Coef Stdev t-ratio 
Constant -11958 13191 -0.91 

C31 35.33 41.54 0.85 

C32 9.628 9.451 1.02 

s = 26319 R-sq = 24.4% Re-sq(adj) = 18.1% 


Analysis of Variance 


SOURCE DF Ss MS 

Regression 2 4578418688 2289209344 

Error 24 14194089984 591420416 

Total 26 18772508672 

SOURCE F SEQ SS 

C31 1 3964684800 

C32 1 613734144 

Unusual Observations 

Obs. C31 CG Fit Stdev.Fit Residual St. Resid 
5 642 100000 35979 9205 64021 2.8GR 
7 742 89608 41031 9878 48577 2.L19R 


R denotes an obs. with a large st. resid. 
MTB > regr cG 1 c32 


The regression equation is 


C4 = = 3977 + 15.9 C32 

Predictor Coef Stdev t-ratio 
Constant -3977 9220 -0.43 

C32 15.850 5.950 2.66 

S$ = 24184 R-sq = 22.1% R-sq(adj) = 19.0% 


Hak 


Analysis of Variance 


SOURCE DF SS MS 

Regression iL 4150585600 4150585600 

Error 25 14621921280 584876800 

Total 26 18772504576 

Unusual Observations 

Obs. C32 CG Fit Stdev.Fit Residual 
5 2623 100000 37603 8954 62397 
7 2780 89608 40079 9760 49529 
25 1138 63558 14060 4804 49498 


R denotes an obs. with a large st. resid. 
MTB > regr ¢30 1 c3s2 


The regression equation is 
C30 = 35.4 + 0.0468 C32 


Predictor Coef Stdev t-ratio 
Constant 35.36 31.42 1.433 
C32 0.04680 0.02028 Osu 
s = 82.41 R-sq = 17.6% R-sq(adj) = 14.3% 


Analysis of Variance 


SOURCE DF Ss MS 

Regression 1 36183 36183 

Error 25 169802 6792 

Total 26 205985 

Unusual Observations 

Cbs. C32 C30 Fit Stdev.Fit Residual 
5 2623 316.2 Tse. 7 30.5 158.1 
25 1138 Zoeel 88.6 16.4 163.5 


R denotes an obs. with a large st. resid. 
MTB > plot cG c5 
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MTB > plot c4 c6 
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MTB > delete 7 cl-c6 
MTB > delete 5 cl-c6 
MTB > print cl-c6 
ROW Cl C2 C3 CG cS C6 
1 1 95 G8 0 95000 48000 
2 2 116 584 3520 116000 584000 
3 3 250 6210 0 250000 6210000 
G G 166 3180 0 166000 3180000 
5 7 26 320 30800 26000 320000 
6 al 190 2066 4693 150000 2066000 
7 12 G88 5451 40826 488000 5451000 
8 13 177 1386 17240 177000 1386000 
9 15 261 2496 0 261000 2496000 
10 16 G 120 3079 4000 120000 
11 17 415 4604 9331 415000 4604000 
12 18 421 2860 20327 421000 2860000 
13 20 417 3607 1007 417000 3607000 
14 21 150 1908 25684 150000 1908000 
15 22 26 153 1615 26000 153000 
16 24 289 4935 5453 289000 4935000 
17 26 Gel 1737 5159 421000 1737000 
18 27 198 847 2757 198000 847000 
19 28 8G 619 2441 84000 619000 
20 29 409 4567 8913 409000 4567000 
21 30 107 854 19714 107000 854000 
22 31 159 87 0 159000 87000 
23 32 472 1295 63558 472000 1295000 
24 33 108 133 2160 108000 133000 
25 34 192 161 7200 192000 161000 
MTB > regr c4 1 c5 
The regression equation is 
CG = 2067 + 0.0397 C5 
Predictor Coef Stdev t-ratio 
Constant 2067 5319 0.39 
cs 0.03968 0.01974 2.01 
Ss = 14542 R-sq = 14.9% R-sq(adj) = 11.2% 


Analysis of Variance 


SOURCE DF Ss Ms 
Regression 1 854694400 854694400 
Error 23 4863979520 211477360 
Total 24 5718671360 


Teo 


Unusual Observations 


Obs. c5 C4 Fit Stdev.Fit Residual 
5 26000 30800 3098 4897 27702 
23 472000 63558 20793 5666 42765 
R denotes an obs. with a large st. resid. 
MTB > let c40 = sqrt(c4) 
MTB > hist c40 
Histogram of C40 N = 25 
Midpoint Count 
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MTB > regr cG0 1 c5 
The regression equation is 
C40 = 45.0 +0.000160 C5 
Predictor Coef Stdev t-ratio 
Constant 44.97 e2oume 1.90 
c5 0.00016045 0.00008803 1.82 
S = 64.86 R-sq = 12.6% R-sq(adj) = 8.8% 
Analysis of Variance 
SOURCE DF ss MS 
Regression 1 13978 13978 
Error 23 96766 4207 
Total 24 110744 


Unusual Observations 


Obs. c5 C40 Fit Stdev.Fit Residual 
5 26000 175.5 49.1 21.8 126.4 
23 472000 252.1 120.7 25.3 131.4 

R denotes an obs. with a large st. resid. 

MTB > regr c4 1 c6 

The regression equation is 

C4 = 9881 + 0.00057 C6 

Predictor Coef Stdev t=-ratio 

Constant 9881 4607 2.15 

C6 0.000566 0.001675 0.34 

s = 15729 R-sq = 0.5% R-sq(adj) = 0.0% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 1 28279680 28279680 

Error 23 5690392576 247408368 

Total 24 5718671360 

Unusual Observations 

Obs. C Fit Stdev.Fit Residual 
3 6210000 0 13398 7707 -~13398 
23 1295000 63558 10615 3366 52943 

R denotes an obs. with a large st. resid. 
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X denotes an obs. whose X value gives it large influence. 
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MTB > regr cG40 1 c6 


The regression equation is 
C40 = 76.9 +0.000002 C6 


Predictor Coef Stdev t-ratio 

Constant 76.95 20.29 3.79 

C6 0.00000211 0.00000738 0.29 

s = 69.27 R-sq = 0.4% R-sq(adj) = 0.0% 

Analysis of Variance 

SOURCE DF Ss MS 

Regression i 391 391 

Error 23 110353 4798 

Total 24 110744 

Unusual Observations 

Obs. C6 C40 Fit Stdev.Fit Residual St.Resid 
3 6210000 0.0 90.0 33.9 -90.0 -1.49 X 
23 1295000 252.1 79.7 14.8 172.4% 2.55R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > let cGi = sqrt(c5) 
MTB > Hist cGi 


Histogram of C4l N = 25 
Midpoint Count 
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MTB > let cG2 = sqrt(cé6) 
MTB > hist cG2 


Histogram of C42 N = 25 
Midpoint Count 
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MTB > regr c4 1 c4l 


The regression equation is 


CG = = 1091 + 27.3 C4l 

Predictor Coef Stdev t-ratio 
Constant -1091 8306 -0.13 
C41 27.34% 17.48 1.56 

Ss = 14992 R-sq = 9.6% R-sqladj) = 5.7% 
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Analysis of Variance 


SOURCE DF Ss MS 

Regression uw 549422336 549422336 

Error 23 5169250304 224750000 

Total 24 5718671360 

Unusual Observations 

Obs. C41 C4 Fit Stdev.Fit Residual St. Resid 
23 687 63558 17691 5215 45867 3.26R 


R denotes an obs. with a large st. resid. 
MTB > regr cG0 1 c4l 


The regression equation 15s 
C40 = 33.6 + 0.107 C41 


Predictor Coef Stdev t-ratio 
Constant 33.56 36.94 0.91 
C41 0.10748 0.07777 1.38 
S = 66.68 R-sq = 7.7% R-sqladj) = 3.7% 


Analysis of Variance 


SOURCE DF Ss MS 

Regression 1 8493 8493 

Error 23 102251 4446 

Total 24 110744 

Unusual Observations - 

Obs. C41 CGO Fit Stdev.Fit Residual St.Resid 
5 161 175.5 50.9 25.56 124.6 2.02R 
23 687 e5ecL 107.4 23.2 144.7 2.31R 


R denotes an obs. with a large st. resid. 
MTB > regr cG 1 c42 


The regression equation is 
CG = 7660 + 2.73 C42 


Predictor Coef Stdev t-ratio 

Constant 7660 6272 1.22 

C42 2.734 4.425 0.62 

S$ = 15639 R-sq = 1.6% R-sqladj) = 0.0% 

Analysis of Variance 

SOURCE DF Ss MS 

Regression 1 93355455 93355456 

Error 23 5625315328 2464578912 

Total 24 5718667254 

Unusual Observations 

Obs. C42 CG Fit Stdev.Fit Residual St.Resid 
23 1138 63558 10771 3153 52787 3.45R 


R denotes an obs. with a large st. resid. 
MTB > regr cGO 1 c&2 


The regression equation is 
C40 = 65.8 + 0.0125 C42 


Predictor Coef Stdev t-ratio 
Constant 65.79 27.58 2.39 
C42 0.01252 0.01946 0.64 
S$ = 68.77 R-sq = 1.8% R-sq(adj) = 0.0% 
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Analysis of Variance 


SOURCE DF ss MS 

Regression 1 1958 1958 

Error 23 1038786 4730 

Total 24 110744 

Unusual Observations 

Obs. C42 C40 Fit Stdev.Fit Residual St.Resid 
23 1138 252.1 80.0 13.9 172.1 2.55R 


R denotes an obs. with a large st. resid. 
MTB > plot c4 c5 
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MTB > delete 23 cl-c6 
MTB > print ci-c6 


ROW Cl C2 C3 CG C5 C6 
1 1 85 G8 0 95000 48000 
2 116 584 3520 116000 584000 
3 3 250 6210 0 250000 6210000 
4 ; 166 3180 0 166000 3180000 
5 7 26 320 30800 26000 320000 
6 11 190 2066 G693 190000 2066000 
7 le 488 5451 40826 488000 5451000 
8 13 177 1386 17240 177000 1386090 
9 15 261 2496 0 261000 2496000 

10 16 4 120 3979 G000 120000 


11 17 415 4604 9331 415000 4604000 
i2 18 G2l 2360 20327 421000 2860000 
13 20 417 3607 1007 417000 3607000 
14% ei 150 1908 25684 150000 1908000 
15 22 26 153 1615 26000 153000 
16 24 289 4935 5453 289000 4935000 
17 26 G21 ba 5 7 ee) 421000 1737000 
18 aT | 198 847 ofat 198000 847000 
Lo 28 84 619 2441 84000 619000 
20 29 G09 G567 8913 409000 4567000 
eli 30 107 854 19714 107000 854000 
22 31 V9 87 0 159000 87000 
23 33 108 133 2160 108000 133000 
24 34 2 161 7200 192000 161000 


MTB > regr c4& 1 c5 


The regression equatiscn is 
CG = 5212 + 0.0168 C5 


Predictor Coef Stdev t-ratio 


Constant 5212 4127 1.26 

c5 0.01580 0.01602 1.05 

S$ = 11094 R-sq = 4.8% R-sq(adj) = 0.4% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 1 135394944 135394944 

Error 22 2707927296 123087600 

Total 23 2843322112 

Unusual Observations 

Cbs. C5 CG Fit Stdev.Fit Residual St.Resid 
5 26000 30800 5649 3785 25151 2.-41R 
7 488000 40826 13410 4919 27416 2.76R 


R denotes an obs. with a large st. resid. 
MTB > regr c4 1 c6 


The regression equation is 
C4 = 6759 + 0.00102 C6 


Predictor Coef Stdev t-ratio 
Constant 6759 3339 2.92 

C6 0.001016 0.001195 0.85 

S$ = 11186 R-sq = 3.2% R-sq(adj) = 0.0% 
Analysis of Variance 

SOURCE DF ss MS 
Regression 1 90405440 90405440 

Error 22 2752916736 125132576 

Total 23 2843322112 


on 


Unusual Observations . 
Obs. C& CS Fit Stdev.Fit Residual St.Resid 


5 320000 30800 7084 3071 237156 2.20R 
7 5451000 40826 12295 4673 28531 2.81R 


R denotes an obs. with a large st. resid. 


MTB > Let c50 = sqrt(c4) 
MTB > hist c50 


Histogram of C50 N = 24 
oo Count 
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MTB > regr c50 1 cS 


The regression equation is 
C50 = 54.6 +0.000090 C5 


Predictor Coef Stdev t-ratio 
Constant 54.64 21.92 2.49 
cS 0.00009015 0.00008509 1.06 
Ss = 58.93 R-sq = 4.9% R-sqladj) = 0.5% 


Analysis of Variance 


SOURCE DF Ss MS 
Regression 1 3899 3899 
Error 22 76409 3473 
Total 23 80308 


Unusual Observations 

Obs. Cc C50 Fit Stdev.Fit Residual St. Resid 
5 26000 75-5 57.0 20.1 118.5 2.14R 
R denotes an obs. with a large st. resid. 

MTB > regr c50 1 c6 


The regression equation is 
C50 = 66.8 +0.00000G C6 


Predictor Coef Stdev t-ratio 

Constant 66.78 17.91 3.73 

C6 0.00000357 0.00000641 0.56 

Ss = 60.00 R-sq = 1.4% R-sqladj) = 0.0% 

Analysis of Variance 

SOURCE DF Ss MS 

Regression 1 1116 1116 

Error 2e 79192 3600 

Total 23 80308 

Unusual Observations 

Obs. C6 c50 Fit Stdev.Fit Residual St. Resid 
7 5451000 202.1 86.2 eo 115.8 2.12R 


R denotes an obs. with a large st. resid. 
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MTB > let c51 = sqrt(c5S) 
MTB > hist c5l1 


Histogram of C51 N = 24 
Midpoint Count 
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MTB > regr c50 1 c5l 


The regression equation is 
CSO = 51.2 + 0.0528 C51 


Predictor Coef Stdev t-ratio 
Constant 51.19 33.77 1.52 
C51 0.05283 0.07278 0.73 
Ss = 59.71 R-sq = 2.3% R-sqladj) = 0.0% 


Analysis of Variance 


SOURCE DFE Ss MS 

Regression 1 1879 1879 

Error 22 78430 3565 

Total 23 80308 

Unusual Observations es 

Obs. C5i c50 Fit Stdev.Fit Residual 
5 161 175.5 59.7 23.2 115.8 
7 699 202.1 S221 22.9 114.0 


R denotes an obs. with a large st. resid. 
MTB > regr c4 1 c5l 


The regression equation 1s 
CG = 4495 + 10.0 C51 


Predictor Coef Stdev t-rati1o 

Constant GG95 6354 0.71 

C51 10.02 13.69 0.73 

S$ = 11233 R-sq = 2.4% R-sqladj) = 0.0% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 1 67520800 67520800 

Error 2e 2775801600 126172800 

Total 23 2843322368 

Unusual Observations 

Obs. c5l1 C4 Fit Stdev.Fit Residual 
5 161 30800 6110 4368 24690 
7 699 40826 11492 4301 29334 


R denotes an obs. with a large st. resid. 
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St.Resid 
2.11R 
2.07R 


St. Resid 
2.39R 
2.83R 


MTB > let c52 = sqrt(c6) 
MTB > hist c52 


Histogram of C52 N = 24 
pacpeant Count 
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MTB > let c53 = logt(c6) 
MTB > hist c53 


Histogram of C53 N = 24 


Midpoint Count 
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MTB > regr c4 1 c52 


The _regression equation is 
CG = 4969 + 3.13 C52 


Predictor Coef Stdev t-ratio 

Constant S969 G49G Lone 

CS2 3.133 3.148 1.00 

Ss = lllel R-sq = 4.3% R-sq(adj) = 0.0% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 1 122523760 122523760 

Error 22 2720798464 123672656 

Total 23 2843322112 

Unusual Observations 

Obs. C52 C4 Fit Stdev.Fit Residual St.Resid 
5 566 30800 6741 3091 24059 2.c5R 
7 2335 40826 12284 4146 28542 c./77R 


R denotes an obs. with a large st. resid. 
MTB > regr c50 1 c52 


The regression equation is 
C50 = 57.0 + 0.0138 C52 


Predictor Coef Stdev t-ratio 
Constant 57.02 24.05 2.37 
c52 0.01382 0.01684 0.82 
s = 59.51 R-sq = 3.0% R-sq(adj) = 0.0% 
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Analysis of Variance 


SOURCE DF ss MS 
Regression 1 2384 2384 
Error 22 77924 3542 
Total 23 80308 
Unusual Observations 
Obs. C52 C50 Fit Stdev.Fit Residual St.Resid 
7 2335 202.1 89.3 22.2 112.8 2.04R 
R denotes an obs. with a large st. resid. 
MTB > regr c50 2 c51 c52 
The regression equation is 
C50 = 53.5 + 9.017 C51 + 0.0108 C52 
Predictor Coef Stdev t-ratio 
Constant 53.47 34.92 1.53 
C51 0.0168 0.1173 0.14 
C52 0.01079 0.02725 0.40 
s = 60.89 R-sq = 3.1% R-sq(adj) = 0.0% 
Analysis of Variance 
SOURCE DF $s MS 
Regression 2 2461 12390 
Error 2i 77848 37907 
Total 23 80308 
SOURCE DF SEQ SS 
C51 1 1379 
C52 1 532 
Unusual Observations 
Obs. c51 C50 Fit Stdev.Fit Residual St.Resid 
5 161 1755 62.3 : ‘ 2.03R 
R denotes an obs. with a large st. resid. 
MTB > plot c4 c5 
- % 
36000 
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24000+ 
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= % % 3 
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fa wen en nna fon nn fe == = eee ee ee C5 
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MTB > plot cG c6 
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APPENDIX D 


REGRESSION ON COMPLEX PROJECTS 


COLUMN 


Cl NUMERICAL LISTING OF THE PROJECTS 


Ce THE 
C3 THE 
CG A/E 
C5 THE 
CS THE 


A/E COSTS IN THOUSANDS 


CONSTRUCTION COSTS IN 


LIABILITY COSTS 
A/E COSTS 
CONSTRUCTION COSTS 


MTB > let c5 = 1000x*c2 
MTB > let c6 = 1000%c3 
MTB > print cl-c6 

ROW C C2 C3 
1 l 1166 1308 
2 2 G03 3470 
5 3 1030 1745 
G & 9670 10100 
5 5 1050 13236 
6 6 14 2426 
7 7 391 54G45G 
8 8 578 7690 
9 9 635 8833 
10 10 147 1945 
ll ll 352 1774 
l2 le 374 2335 
13 13 561 5158 
14 14 G8 398 
L5 15 723 4300 
16 16 1354 7644 
17 17 65 957 
18 18 368 3575 
19 19 361 3925 
20 20 Ge7 2107 
21 21 763 G613 
22 22 313 5788 
23 23 352 3699 
24 2G GG7 4075 
25 25 580 6692 
26 26 104 6556 
2i o7 70 1008 
28 28 73 977 
29 29 1082 3905 
30 30 312 1669 
31 31 lil 651 
32 32 G9G G666 
33 eo 77 791 
34 34 1494 2634 
35 35 61 511 
36 36 626 4203 
Si 37 157 1591 
38 38 93 712 
39 39 213 2066 
40 GO 773 5549 
Gl Gl 187 1394 
G2 Ge 68 Gl 
G3 G3 80 701 
GG GG 525 S745 
G45 45 65 658 
G6 G6 52 522 


THOUSANDS 
C4 C5 
4330 1166000 
403000 
5389 1030000 
21000 9670000 
0 1050000 
25500 14000 
ens. 391000 
G7 Sa 578000 
636000 
| 147000 
115896 352000 
188610 374000 
561000 
48000 
13498 723000 
3720 1354000 
0 65000 
74675 368000 
8528 361000 
11180 427000 
1Soat 763000 
88962 313000 
41016 352000 
101121 447000 
93000 580000 
0 104000 
0 70000 
29i2 73000 
G5669 1082000 
27676 312000 
4450 111000 
623 494000 
6551 77000 
146034 1494000 
3048 61000 
5488 626000 
7373 157000 
204700 93000 
1370 213000 
766336 773000 
5961 187000 
0 68000 
1520 80000 
9608 525000 
1431 65000 
18503 52000 


oi) 


C6 


1308000 
3470000 
1745000 


10100000 
13236000 


2426000 
5454000 
7690000 
8833000 
1945000 
1774000 
2835000 
51580900 
398000 
4300000 
7644000 
957000 
3575000 
3925000 
2107000 
4613000 
5788000 
3699000 
4075000 
6692000 
6556000 
1008000 
977000 
3905000 
1669000 
651000 
4666000 
791000 
2634000 
511000 
4203000 
1591000 
712000 
2066000 
5549000 
1394000 
41000 
701000 
3773000 
658000 
522000 


MTB > regr cG4 1cS5 


The regression equation is 


C4 = 45900 + 0.0014 C5 

Predictor Coef Stdev t-ratio 

Constant 45900 19537 2.255 

cS 0.00137 0.01275 0.11 

Ss = 120866 R-sq = 0.0% R-sq(adj)} = 0.0% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 1 169121776 169121776 

Error GG 642773352448 1460848230G 

Total G5 642942435328 

Unusual Observations 

Obs. cs Fit Stdev.Fit Residual 
G 9670000 21000 59165 116646 -38165 
40 773000 766336 46961 17916 719375 

R denotes an obs. with a large st. resid. 


St. Resid 
-1.21 X 
6.02R 


X denotes an obs. whose X value gives it large influence. 


MTB > regr cG 1 c6 


The regression equation is 


CG = 32702 + 0.00408 C6 

Predictor voet Stdev teratio 
Constant 32702 27891 Lod 

C6 0.004085 0.006254 0.65 

Ss = 120300 R-sq = 1.0% R-sq(adj) = 0.0% 
Analysis of Variance 

SOURCE — DF S MS 
Regression 1 6175002624 6175002624 

Error GG 636767502336 14471987200 


Total G5 642942500864 


Unusual Observations 
C6 Cc 


Obs. Fit Stdev.Fit 
G4 10100000 21000 73960 45258 
5 13236000 0 86771 63766 
GO 5549000 766336 55370 22096 

R denotes an obs. with a large st. resid. 


X denotes an obs. 
MTB > hist cG 


Histogram of CG N = G6 
Midpcint Count 
0 SO ——_ HR HEHEHE BE HE IE IE IE FE BE IE IE FE FE HE IE DE DE DE FE IE DE DE EE DE DE DE HE 3E IEE EE 
100000 7-H REE 
200000 o xx 
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400000 0 
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700000 0 
800000 l x 


oak 


Residual 
-52960 
-86771 
710966 


St.Resid 
8 X 


=Ve 


-0.85 X 
6.01R 


whose X value gives it large influence. 


MTB > let cl1l0O = sqrt(c4) 
MTB > Rist cl0 


Histogram of C10 N = 46 
aaah Count 
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MTB > regr cl10 1 c5 


The regression equation is 
C10 = 135 +0.000007 CS 


Predictor Coef Stdev t-ratio 
Constant 135.46 27.16 G.98 
C5 0.00000737 0.00001773 0.42 
s = 168.1 R-sq = 0.4% R-sq(adj) = 0.0% 
Analysis of Variance 
SOURCE DE ss MS 
Regression 1 46881 G83l 
Error GG 1243660 4-74-5 
Total G5 4248540 
Unusual Observations 
S. C5 C10 Fit Stdev.Fit Residual 
G 9670000 144.9 206.7 fee 73 -61.8 
G0 773000 875.4 141.1 24.9 734.3 


R denotes an obs. with a large st. resid. 


St.Resid 
-1.40 X 
G.G2R 


X denotes an obs. whose X value gives it large influence. 


MTB > regr cl10 1 c6 


The regression equation is 
C10 = 122 +0.000005 C6 


Predictor Coef Stdev t-ratio 
Constant 121.75 38.89 3.13 
C6 0.00000532 0.00000872 0.61 
S = 167.7 R-sq = 0.8% R-sq(adj) = 0. 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 10482 10482 
Error GG 1238058 28138 
Total G5 1248540 


9% 


0% 


Unusual pS a 


Obs. C10 Fit Stdev.Fit Residual St. Resid 
4 10100000 144.9 175.5 63.1 -30.6 -0.20 X 
5 13236000 0.0 192.2 88.9 -192.2 -1.35 X 
40 5549000 875.4 151.3 30.8 724.1 4.39R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > hist cS 
Histogram of C5 N = 46 


Midpoint Count 
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MTB > hist cé 
Histogram of C6 N = 46 
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MTB > let cll =l/c5 
MTB > hist cll 


Histogram of Cll N = 46 
Midpoint Count 
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MTB > let ci2 = 
MTB > hist cl2 


logt(c5) 


Histogram of C12 N = G6 
Midpoint Count 
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MTB > let cl3 = Sqrt(c5) 
MTB > hist ¢cl13 


Histogram of C13 N = G6 
Midpoint Count 
0 1 x 
G00 roe tS SSSS SS SSCS SSS TT TY 
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1600 
2000 0 
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MTB > let c14 = logt(c6) 


MTB > hist c14 
Histogram of C14 N = 46 


Count 
Ll x 


Midpoint 


x 

HE HE HE FE HE HE IE 

HHH 

HH HEH HEHE KE 

HR HEH HEH 

HEH HF IE IE IE IE EEE 
HH EH HH KH 

HR 

x 


& . eo . 8 . 
— 
MNO UISAIGIAIFe OOOO 


“ISI OOO DUI UT UT UT Up 
NOCODANOODHMOWODd 


MTB > let cl15 =sqrt(c6) 
MTB > hist c15 


Histogram of C15 N = 46 
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MTB > regr c4& 1 cl2 


The regression equation is 


C46 = = 162157 + 38306 Cl2 

Predictor Coef Stdev t-ratio 

Constant -162157 183401 -0.88 

Cle 38306 33473 1.14 

s = 119122 R-sq = 2.9% R-sq(adj) = 0.7% 

Anaiysis of Variance 

SOURCE DF Ss MS 

Regression 1 18583924736 185839247356 

Error GG 6246358588416 14189965312 

Total G5 6429425008584 

Unusual Observations 

Obs. Siz C4 Fit Stdev.Fit Residual 
G 6.99 21000 105427 54188 -~84427 
6 4.15 25500 = 3335 47168 28835 
40 5.89 766336 63396 22798 702940 

R denotes an obs. with a large st. resid. 


St.Resid 
-0.80 X 
0.26 X 
6.01R 


X denotes an obs. whose X value gives it large influence. 


MTB > rear ci0 1 ¢l2 


The regression equation is 


C10 = = 272 + 75.5 C12 
Predictor Coef 
Constant -272.9 
Cl2 75.56 
s = 163.5 R-sq = 5.8% 
Analysis of Variance 
SOURCE DF ss 
Regression 1 72308 
Error GG 1176233 
Total G5 1248541 
Unusual Observations 
Obs. C12 C10 
G 6.99 144.9 
6 4.15 159.7 
38 4.97 452.4 1 
40 5.89 875.4 1 


R denotes an obs. with a lar 


tdev t-ratio 
251.7 -1.08 
45.9G 1.64 
R-sq(adj) = 3. d% 
MS 
72308 
26733 
Fit Stdev.Fit Residual 
55.8 74.4 -110.9 
41.3 64.7 118.4 
03.4 32.8 349.0 
72.9 31.3 702.5 
ge st. resid. 


St.Resid 
-0.76 X 
0.79 X 
2.18R 
4.38R 


X denotes an obs. whose X value gives it large influence. 


pO 


The regression equation is 


MTB > regr c4 


CG = 10166 + 21.6 C15 
Predictor Coef 
Constant 10166 
C15 21.59 
s = 119730 R-sq = 1.9% 


Stdev t-ratio 

43442 0.23 

23.42 0.92 
R-sq(adj) = 0.0% 


oe 


Analysis of Variance 


SOURCE DF ss MS 

Regression 1 12185374720 12185374720 

Error GG 630757130240 14335385600 

Total G5 642942500864 

Unusual Observations 

Obs. Gis e) Fit Stdev.Fit Residual 
5 3638 0 88709 48802 

GQ 2356 766336 61021 234790 

R denotes an obs. with a large st. resid. 


X denotes an obs. 


MTB > regr cl1O0 1 cl5 


The regression equation is 
C10 = 80.3 + 0.03535 ¢15 


Predictor Coef 
Constant 80.28 
cis 0.03527 
s = 166.2 R-sq 


Analysis of Variance 


SOURCE DF 
Regression 1 
Error 4G 
Total 45 


Unusual Observations 
15 Cl 


Obs. 

5 3638 Q. 
38 844 G52. 
G0 2356 875. 


R denotes an obs. with a large st. 


Stdev t-ratio 
60.32 1.33 
0.03251 1.08 
= 2.6% R-sq(adj) = 0.4% 
Ss MS 
32526 32526 
1215014 27637 
1248540 
Fit Stdev.Fit Residual 
0 08.6 67.8 -208.6 
& 110.0 37.9 342.G 
be 163.4 32.6 712.9 
resid. 


-838709 
705315 


6.01R 


St.Resid 
a xX 


whose X value gives it large influence. 


St. Resid 
-1.37 X 
2.11R 
4.37R 


X denotes an obs. whose X value gives it large influence. 


MTB > regr cl190 2 cli2 


cis 


The regression equation is 


Ci0 = = 286 + 79.2 C12 - 90.0036 ci5 

Predictor Coef Stdev t-ratio 
Constant -285.7 308.7 -0.93 
Ci2 79.17 65.50 1.21 
cid -0.00356 0.04559 -0.08 

Ss = 165.4 R-sq = 5.8% R-sq(adj)} = 1.4% 


Analysis of Variance 


SOURCE DF 
Regression 2 
Error G3 
Total G5 
SOURCE DF 
C12 1 
cis5 1 


Ss 
72475 
1176066 
1248540 


SEQ SS 
72308 
167 


MS 
36237 
2/7350 


oS 


Unusual Observations 


Obs. Cl2 C10 Fit Stdev.Fit Residual 
4 6.99 144.9 256.0 75.3 -111. 
6 4.15 159.7 37.90 84.9 lac: 
38 4.97 452.4 104.7 37.0 347. 
40 5.89 875.4% 172.1 33.2 703. 


R denotes an obs. with a large st. resid. 


X denotes an obs. whose X value gives it large influence. 


MTB > plot c4& c5 
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MTB > plot c&4 c6 
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MTB >» delete 40 cl-c6 


MTB > delete 38 cl-c6 
MTB >» delete 4 cl-c6 


oy 


10000000 


12500000 


MTB > print cl-cé 


ROW Cl C2 C3 CG C5 
MTB > print cl-cé 
ROW Cl Ce Gs CG C5 
1 1 1166 1308 4380 1166000 
2 2 G03 3470 0 403000 
3 3 1030 1745 5389 1030000 
G 5 1050 13236 0 1050000 
5 6 14 2426 25500 14000 
6 7 391 5454 12191 391000 
7 8 578 7690 67519 578000 
8 9 636 8833 0 636000 
9 10 147 1945 0 147000 
10 ll 352 1774 115896 352000 
11 12 374 2835 188610 374000 
12 13 561 5158 0 561000 
13 14 G8 398 0 48000 
14 15 723 4300 13498 723000 
15 16 1354 7644 3720 1354000 
16 skye 65 957 0 65000 
17 18 368 3575 74675 368000 
18 19 361 3925 8528 361000 
19 20 G27 2107 11180 427000 
20 2l 763 G613 15611 763000 
21 2e 313 5788 88962 313000 
22 23 352 3699 41016 352000 
23 24 GG7 4075 101121 447000 
24 25 580 6692 93000 580000 
25 26 104 6556 0 104000 
26 e7 70 1008 8) 70000 
27 28 73 977 2911 73000 
28 29 1082 3905 45669 1082000 
29 30 312 1669 27676 312000 
30 So 111 651 4450 111000 
31 32 G9G G666 623 494000 
32 33 77 791 6551 77000 
33 34 1494 2634 146034 1494000 
34 35 61 Sil 3048 61000 
35 36 626 G203 5488 626000 
36 37 157 1591 7373 157000 
37 39 213 2066 1370 213000 
38 G41 187 1394 5961 187000 
39 G2 68 Gl 0 68000 
40 G3 80 701 1520 80000 
Gl GG 525 3773 9608 525000 
Ge G45 65 658 1431 65000 
G3 G6 52 522 18503 52000 
MTB > let c20 = sqrt(cG) 
MTB > hist cG 
Histogram of C4 N = 43 
Midpoint Count 
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MTB > hist c20 
Histogram of C20 N = 43 
Midpoint Count 
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MTB > rear c4 1 c¢5 


The regression equation is 
CG = 15351 + 0.0272 C5 


Predictor Coef Stdev t-ratio 
Constant 13351 10079 1 2 

C5 0.02718 0.01772 1.53 

S = 43670 R=-sq = 5.4% R-sq(adj) = 3.1% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 4486594560 4486594560 

Error G1 78189297654 1907055872 

Total G2 32675892224 


Unusual Observations 


Obs. cs CG Fit Stdev.Fit Residual St.Resid 
10 352000 115896 24919 6790 90977 2.11R 
li 374000 188610 25517 6725 163092 3.78R 
15 1354000 3720 52158 17730 ~48438 ~l.el xX 
33 1494000 146034 55963 20051 90071 2.32RX 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


38) 


MTB > regr c20 1 c5 


The regression equation 1s 
C20 = 82.7 +0.000077 CS 


Predictor Coef Stdev t=-ratio 
Constant 82.74 26.69 3-« LO 
cs 0.00007697 0.00004694 1.64 
s = 115.7 R-sq = 6.2% R-sql(adj) = 3 
Analysis of Variance 
SOURCE DF ss MS 
Regression 1 35966 35966 
Error Gi 548496 13378 
Total G2 53844961 
Unusual Observations 
Obs. es C20 Fit Stdev.Fit Re 
10 352000 340.4% 109.8 18.0 

11 374000 434.3 111.5 17.8 

15 1354000 61.0 187.90 47.0 

33 1494000 382.1 197.7 Bs. 1 


R denotes an obs. with a large st. resid. 


9% 


Sidual 
230.6 
322.8 

-126.0 
184.4 


St. Resid 
~ 2.02R 
2.82R 
=a, 19 xX 
1.79 X 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c4 1 c6& 


The rearession equation is 
CG = 21906 + 0.00153 C6 


Predictor Coeff Stdev t-ratio 
Constant 21905 10774 2.03 
C6 0.001529 0.002527 0.61 
S$ = 44706 R-sq = 0.9% R-sq(adj) = 0 
Analysis of Variance 

SOURCE — DF ss MS 
Regression 1 731589632 731589632 
Error G1 81944313856 1998641664 
Total G2 82675892224 


Unusual Observations 
6 CG 


Obs. C Fit Stdev.Fit Re 
4 13236000 0 42143 26014 

10 1774000 115896 24618 7835 

li 2835000 188610 26241 6919 

33 2634000 146034 25933 7023 


R denotes an obs. with a large st. resid. 


0% 


Sidual 
-42143 

91278 
162369 
120101 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c20 1 c6 


The regression equation is 
C20 = 104 #+0.000003 C6 


Predictor Coef Stdev t-ratio 
Constant 104.07 28.68 3.63 
C6 0.00000349 0.00000673 0.52 
Ss = 119.0 R-sq = 0.7% R-sql(adj) = 0 


100 


0% 


Analysis of Variance 


SOURCE DF ss MS 
Regression 1 3816 3816 
Error 41 580646 14162 
Total 42 584461 


Unusual Observations 


Obs. C6 C20 Fit Stdev.Fit Residual St. Resid 
G 13236000 0.0 150.3 69.2 -150.3 -1.55 X 
11 2835000 434.3 114.0 18.4 320.3 2.72R 

33 2634000 382.1 113.3 18.7 268.9 2.29R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > let c21 = sqrt(c5) 
MTB > hist cS 


Histogram of C5 N = 43 
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MTB > hist c2l 
Histogram of C2l N = G3 
Midpoint Count 
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MTB > let c22 = logt(c5) 
MTB > hist c22 


Histogram of C22 N = 43 
Midpoint Count 
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MTB > let c23 = sart(c6) 
MTB > hist c23 


Histogram of C23 N = 43 
sails Count 
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MTB > regr cG4 1 cel 


The regression equation is 
CG = 2792 + G1.1 C21 


Predictor Coef Stdev t-ratio 

Constant 2792 15101 0.18 

C21 41.11 23.11 a hp 33 

S = 43267 R-sq = 7.2% R-sqladj) = 4.9% 

Analysis of Variance 

SOURCE DF Ss MS 

Regression 1 5923262464 5923262464 

Error 41 76752617472 1872014848 

Total G2 82675826688 

Unusual Observations 

Obs. C2l CS Fit Stdev.Fit Residual St.Resid 
10 593 115896 27185 6599 88711 2.907R 
zh 612 188610 27936 6621 160674 3.76R 
33 1222 146034 53046 16084 92988 2.32R 


R denotes an obs. with a large st. resid. 
MTB > regr c20 1 c2il 


The regression equation is 
C20 = G5.5 + 0.119 C2l 


Predictor Coef Stdev t-ratio 
Constant 45.53 39.86 1.15 
C21 0.11922 0.06100 1.95 

S = 114.2 R-sq = 8.5% R-sqladj) = 6.3% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 49805 49805 

Error 41 534656 13040 

Total 42 584461 


Unusual Observations 
Obs. 21 c20 Fit Stdev.Fit Residual St.Resid 
Gal 612 434.3 118.4 17.5 315.9 2.80R 


R denotes an obs. with a large st. resid. 
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MTB > regr c4& 1 c23 


The regression equation is 
C46 = 9661 + 10.4 C23 


Predictor Coef Stdev t-ratio 
Constant 9661 16853 0.57 
C23 10.385 9.275 ile 
S = 446234 R-sq = 3.0% R-sq(adj) = 0.6% 


Analysis of Variance 


SOURCE DF ss MS 

Regression 1 2453259264 2453259264 

Error G1 80222683136 1956650752 

Total 42 82675892224 

Unusual Observations 

Obs. C23 C4 Fit Stdev.Fit Residual St. Resid 
4 3638 0 47445 19504 -~47445 -1.20 X 
10 1332 115896 23494 7420 92402 2.12R 
11 1684 188610 27148 6748 161462 3.69R 
33 1623 146034 26516 6757 119518 2.73R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr c20 1 c23 


The regression equation is 
C20 = 67.0 + 0.0292 C23 


Predictor Coef Stdev t-ratio 
Constant 67.01 44.73 1.50 
C23 0.02917 0.02462 1.19 

s = 117.4 R-sq = 3.3% R-sq(adj) = 1.0% 
Analysis of Variance 

SOURCE | DF SS MS 
Regression l 19360 19360 

Error G1 565101 13783 

Total G2 584461 


Unusual Observations 


Obs. C23 C20 Fit Stdev.Fit Residual St.Resid 
4 3638 0.0 173.2 51.8 -173.2 -1.64 X 
10 1332 340.4 105.9 19.7 234.6 2.03R 
11 1684 434.3 116.1 17.9 318.2 2.74R 
33 1623 3382.1 114.4% 17.9 267.8 2.31R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr c20 2 c2l c23 


The regression equation is 
C20 = 45.9 + 0.120 C21 - 0.0005 C23 


Predictor Coef Stdev t-ratio 
Constant 45.87 G6.22 0.99 
Cel we 0.11997 0.07949 1.51 
C23 -0.00047 0.03120 -0.01 
s = 115.6 R-sq = 8.5% R-sql(adj) = 3.9% 


Analysis of Variance 


SOURCE DF Ss MS 

Regression 2 49808 24904 

Error 40 534654 13366 

Total 42 584461 

SOURCE DF SEQ SS 

C21 al 49805 

C23 1 3 

Unusual Observations 

Obs. C21 c20 Fit Stdev.Fit Residual St. Resid 
1 1080 66.2 174.9 53.9 -108.7 -1.06 X 
11 612 434.3 118.5 Oy 315.8 2.76R 
33 1222 382.1 191.7 54.2 190.4 1.86 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > plot c4 c5 
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MTB > plot c4& c6 
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MTB > delete 33 cl-c6 
MTB > delete 11 cl-c6 
MTB > delete 4 cl-c6 
MTB > print cl-cé6 

C C2 


ROW C3 CG 
i 1 1166 1308 4380 
2 2 403 3470 0 
3 3 1030 1745 5389 
4 6 14 2426 25500 
5 7 391 5454 12191 
6 8 578 7690 67519 
7 9 636 8833 0 
8 10 147 1945 0 
9 11 352 1774 115896 

10 13 561 5158 0 
hak 14 48 398 0 
12 15 723 4300 13498 
is 16 1354 7644 3720 
14 17 65 957 0 
15 18 368 3575 74675 
16 19 361 3925 8528 
T7 20 G27 2107 11180 
18 2l 763 4613 15611 
19 22 313 5788 88962 
20 23 352 3699 41016 
21 24 G47 4075 101121 
22 25 580 6692 93000 
23 26 104 6556 0 
24 27 70 1008 0 
25 28 73 977 2911 
26 29 1082 3905 45669 
oy 30 3i2 1669 27676 
28 31 Lar 651 4450 
29 32 49G G666 623 
30 33 Far 791 6551 
3l 35 61 511 3048 
32 36 626 4203 5488 
33 37 157 1591 7373 
34 39 213 2066 1370 
35 G1 187 1394 5961 
36 G2 68 Gl 0 
37 43 80 701 1520 
38 GG 525 3773 9608 
39 45 65 658 1431 
40 G6 52 522 18503 
MTB > let c30 = sqrt(c4) 
MTB > hist cG 
Histogram of C4 N = 40 
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MTB > hist c30 
Histogram of C30 N = GO 
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MTB > regr cG 1 c5 


The regression equation is 
CG = 16045 + 0.0118 C5 


Predictor Coef Stdev t-ratio 
Constant 16045 7774 2.06 
cs 0.01183 0.01522 0.78 
Ss = 32200 R-sq = 1.6% R-sql(adj) = 0 
Analysis of Variance 

SOURCE DF Ss MS 
Regression l 625817344 625817344 
Error 38 39400640512 1036858880 
Total 39 40026456064 


Unusual Observations 


0% 


Obs. C5 CG Fit Stdev.Fit Residual St. Resid 
1 1166000 4380 29835 12921 -25455 ~0.86 X 
9 352000 115896 20208 STL / 95688 3.01R 
13. 1354000 3720 32058 15592 - 28338 -1.01 X 
19 313000 88962 19747 5211 69215 2.18R 
21 447000 101121 21332 5176 79789 2.51R 
22 580000 93000 22905 5887 70095 2.21R 

R denotes an obs. with a large st. resid. 

X denotes an obs. whose X value gives it large influence. 


MTB > regr c30 1 cS 


The regression equation is 
C30 = 82.1 +0.000056 C5 


Predictor Coef Stdev t-ratio 
Constant 82.08 24.10 3.41 
cS 0.00005643 0.00004719 1.20 
S = 99.83 R-sq = 3.6% R-sq(adj) = 1 


Analysis of Variance 


SOURCE DF ss MS 
Regression 1 14247 14247 
Error 38 378719 9966 
Total 39 392966 
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21% 


Unusual Observations 
C5 C30 


Obs. Fit Stdev.Fit Residual St.Resid 
1 1166000 66.2 47.9 ° -81.7 -0.89 x 
9 352000 340.4% 101.9 15.9 238.5 2.42R 
13. 1354000 61.0 158.5 48.3 -97.5 -1.12 X 
19 313000 298.3 99.7 16.2 198.5 2.02R 
21 447000 318.0 107.3 16.0 210.7 2.14R 
R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 
MTB > regr cG 1 c6 
The regression equation is 
C4 = 8304 + 0.00399 C6 
Predictor Coef Stdev t-ratio 
Constant 8304 8208 1.901 
C6 0.003993 0.002135 1.87 
S$ = 31056 R-sq = 8.4% R-sqladj) = 6.0% 
Analysis of Variance 
SOURCE DF ss MS 
Regression 1 3375790592 3375790592 
Error 38 36650668032 964491264 
Total 39 40026456064 
Unusual Observations 
Obs. C6 Fit Stdev.Fit Residual St.Resid 
7 8833000 0 43577 13222 -43577 -1. X 
9 1774000 115896 15388 5648 100508 3.29R 
21 4075000 101121 24577 5349 76544 2.50R 
R denotes an obs. with a large st. resid. 
X denotes an obs. whcese X value gives it large influence. 
MTB > regr c30 1 c6 
The regression equation is 
C30 = 68.0 +0.000012 C6 
Predictor Coef Stdev t-ratio 
Constant 67.98 25.88 2.63 
C6 0.00001164 0.00000673 levis 
S = 97.91 R=-sq = 7. 37% R-sq(adj) = G.9% 
Analysis of Variance 
SOURCE _ DF Ss MS 
Regression 1 28690 28690 
Error 38 364276 9586 
Total 39 392966 
Unusual Observations 
Obs. C6 C30 Fit Stdev.Fit Residual St.Resid 
7 8833000 0.0 170.8 41.7 -170.8 -1.93 X 
9 1774000 340.4 88.6 17.8 251.8 2.62R 
21 44075000 318.0 115.4 16.9 202.6 2.10R 
R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 
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MTB > let c31 
MTB > hist c5 


Histogram of C5 


Count 
ll 


1 


Midpoint 
0 


200000 
400000 
600000 
800000 
1000000 
1200000 
1400000 


MTB > hist c3l 
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MTB > let c32 = sqrt(c6) 
MTB > hist c6 
Histogram of C6 N = 40 
Midpoint Count 
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MTB > let c33 = logt(cé6) 
MTB > hist c33 


Histogram of C33 N = 40 
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MTB > regr c4 1 c3l 


The regression equation is 
C4 = 7095 + 24.1 C3l 


Predictor Coef Stdev t-ratio 
Constant 7095 11629 0.61 
C31 24.12 18.72 1.29 
S = 31768 R-sq = 4.2% R-sq(adj) = 1.7% 


Analysis of Variance 


SOURCE DF Ss MS 
Regression 1 1675502336 1675502336 
Error 33 38350954496 1009235456 
Total 39 40026456064 


Unusual Observations 


Obs. C31 CG Fit Stdev.Fit Residual St. Resid 
9 593 115896 21406 5061 94490 3.01R 
13 1164 3720 35162 12361 -31442 -1.07 X 
19 559 88962 20590 5023 68372 2.18R 
2l 669 101121 23222 5417 77899 2.49R 
22 762 93000 25465 6280 67535 2.17R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr c30 1 c3l 


The regression equation is 
C30 = 49.6 + 0.0968 C31 


Predictor Coef Stdev t-ratio 
Constant 49.64 35.93 1.38 
Cai 0.09676 0.05783 1.67 

S = 98.14 R-sq = 6.9% R-sqladj} = 4.4% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 26960 26960 

Error 38 366006 9632 

Total 39 392966 


aos 


Unusual Observations 


Obs. C31 C30 Fit Stdev.Fit Residual St. Resid 
9 593 340.4 107.0 15.6 233.4 2.41R 
13 1164 61.0 162.2 38.2 -101.2 -1.12 X 
19 559 298.3 103.8 15.5 194.5 2.01R 
21 669 318.0 114.3 16.7 203.7 2.11R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr c30 2 c31 c6 


The regression equation is 
C30 = 48.2 + 0.0575 C31 +0.000008 C6 


Predictor Coef Stdev t=-ratio 

Constant 48.21 36.05 1.34 

Csi 0.05750 0.07263 0.79 

C6 0.00000760 0.00000847 0.90 

S = 98.39 R<-sq = 8.8% R-sqladj) = 3.9% 

Analysis of Variance 

SOURCE | DF Ss MS 

Regression 2 34758 17379 

Error 37 358208 9681 

Total 39 392966 

SOURCE DF SEQ SS 

C31 1 26960 

C6 1 7798 

Unusual Observations 

Obs. C31 C30 Fit Stdev.Fit Residual St.Resid 
1 1080 66.2 120.2 50.7 -54.1 -0.6G xX 
9 593 340.4 95.8 20.1 244.6 2.5GR 
oy 669 318.0 117.6 lg 2 200.4 2.07R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > plot c4 c5 
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MTB > plot c4 c6 
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APPENDIX E 


REGRESSION ON VERY COMPLEX PROJECTS 


C1 NUMERICAL LISTING OF THE PROJECTS 
C2 THE AYE COSTS IN THOUSANDS 
C3 THE CONSTRUCTION COSTS IN THOUSANDS 


CG A/ZE LIABILITY COSTS 
CS THE AE COSTS 
C6 THE CONSTRUCTION COSTS 


MTB > let c5 = 1000x%c2 

MTB > let c6 = 1000%c3 

MTB > print cl1-c6 

ROW Cl C2 C3 CG C5 C6 
1 1 8512 15411 10154 8512000 15411000 
2 2 589 6928 0 589000 6928000 
3 3 1347 9912 1616460 1347000 9912000 
G G 766 7947 132298 766000 7947000 
5 5 1349 20566 4950 1349000 20566000 
6 6 10156 G896 24064 10156000 4896000 
7 7 731 5150 88581 731000 5150000 
8 8 1116 14083 55536 1116000 14083000 
i) 9 985 1143 48526 985000 1143000 
10 10 2117 6091 199719 2117000 6091000 
11 11 3433 5399 17520 3433000 5399000 

MTB > regr c4 1 c5 

The regression equation is 

C4 = 274527 - 0.90264 C5 

Predictor Coef Stdev t-ratio 

Constant 274527 198002 1.39 

s = 4690781 R-sq = 3.5% R-sq(adj) = 0.0% 

Analysis of Variance 

SOURCE DF Ss MS 

Regression 1 77711671296 77711671296 

Error 9 2.167797Et12 240866295808 

Total 10 2.245508E+12 

Unusual Observations 

Obs. C5 C4 Fit Stdev.Fit Residual St. 
3 1347000 1616460 238926 163223 1377533 
6 10156000 24064 6106 371728 17958 

R denotes an obs. with a large st. resid. 


X denotes an obs. whose X value gives it large influence. 


MTB > regr cG 1 c6 


The regression equation is 


CG = 184115 + 0.0018 C6 

Predictor Coef Stdev t=-ratio 
Constant 184115 289193 0.64 

C6 0.00177 0.02785 0.06 

S$ = 499389 R-sq = 0.0% R-sq(adj) = 0.0% 
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Analysis of Variance 


SOURCE DF Ss MS 

Regression 1 1006483712 1006483712 

Error 9 2.244502Et12 2469389121536 

Total 10 2.245508E+12 

Unusual Observations 

Obs. C6 C4 Fit Stdev.Fit Residual St. Resid 
3 9912000 1616460 201651 153363 1414808 2.98R 


R denotes an obs. with a large st. resid. 
MTB > hist c4 

Histogram of C4 N= 11 
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MTB > Let cl0 = sart(cG) 
MTB > Rist cl10 


Histogram of C10 N= 11 
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MTB > regr cl10 1 c5 


The regression equation is 
C10 = 368 -0.000024 C5 


Predictor Coef Stdev t-ratio 

Constant 368.0 144.0 2.56 

C5 -0.00002424 0.00003383 -0.72 

Ss = 356.8 R-sq = 5.4% R-sq(adj) = 0.0% 

Analysis of Variance 

SOURCE | DF Ss MS 

Regression 1 65364 65364 

Error 9 1145913 127324 

Total 10 1211277 

Unusual Observations 

Obs. C5 C10 Fit Stdev.Fit Residual St.Resid 
3 1347000 1271 335 119 936 2.78R 
6 10156000 155 122 279 33 0.14 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 
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MTB > regr cl10 1 c6 


The regression equation is 
C10 = 338 -0.000004 C6 


Predictor Coef Stdev t-ratio 
Constant 338.5 211.9 1.60 
C6 -0.00000440 0.900002041 -0.22 
Ss = 365.9 R-sq = 0.5% R-sql(adj} = 0.0% 


Analysis of Variance 


SOURCE DF ss MS 
Regression 1 6221 6221 
Error 9 1205056 133895 
Total 10 1211277 


Unusual Observations 

Obs. C6 C10 Fit Stdev.Fit Residual 
3 9912000 1271 295 112 977 

R denotes an obs. with a large st. resid. 

MTB > hist c5 


Histogram of C5 N=11 
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MTB > hist c6 
Histogram of C6 N= 11 
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MTB > let cll = sqrt(c5) 
MTB > hist cll 


Histogram of Cll N= 11 
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MTB > let cl2 = logt(c5) 
MTB > hist cl2 


Histogram of C12 N= 11 
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MTB > let cl3 = 1/7c5 
MTB > hist cl3 


Histogram of C13 N = 11 
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MTB > let c14 = sart(c6) 
MTB > hist c14 


Histogram of C14 N= 11 
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MTB > let cl5 = logt(c6) 
MTB > hist cl15 


Histogram of C15 N= 11 
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MTB > regr c4 1 cll 


The regression equation is 
C4 = 335756 - 92 Cll 


Predictor Coef Stdev t-ratio 
Constant 335756 313153 C7 
Cll -91.9 186.2 -0.49 
S = 492884 R-sq = 2.6% R-sq(adj) = 0.0% 


Analysis of Variance 


SOURCE DF ss MS 

Regression 1 59099402240 59099402240 

Error 9 2.186G09E+12 242934284288 

Total 10 2.245508E+12 

Unusual Observations 

Obs. Cli C4 Fit Stdev.Fit Residual 
3 1161 1616460 229146 160078 1387313 


R denotes an obs. with a large st. resid. 
MTB > regr cl10 1 cll 


The regression equation is 
C10 = 425 - 0.085 Cll 


Predictor Coef Stdev t-ratio 
Constant 425.90 228.2 1.86 
Cll -~0.0848 0.1357 -~0.63 

s = 359.1 R-sq = 4.2% R-sqladj) = 0.0% 
Analysis of Variance 

SOURCE DF ss MS 
Regression 1 50400 50400 

Error 9 1160876 128986 

Total 10 1211276 


Unusual Observations 
Obs. C1i1 C10 Fit Stdev.Fit Residual 
3 1161 1271 327 117 945 


R denotes an obs. with a large st. resid. 
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St. Resid 
2.78R 


MTB > regr c4 1 c15 


The regression equation is 


CG = - 967939 + 170443 C15 

Predictor Coef Stdev t-ratio 
Constant -967939 3196534 -0.30 
Cis 170443 466054 0.37 

Ss = 495830 R-sq = 1.5% R-sq(adj) = 0.0% 


Analysis of Variance 


SOURCE DF SS MS 

Regression 1 32881381376 32881381376 

Error 9 2.212627E+12 245847490560 

Total 10 2.245509E+12 

Unusual Observations 

Obs. C15 CG Fit Stdev.Fit 
3 7.00 1616460 224506 164052 
9 6.06 48526 64611 398745 


R denotes an obs. with a large st. resid. 


Re 


Sidual 


1391954 


~16085 


St. Resid 
2.97R 
-0.05 X 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c10 1 ¢cl15 


The regression equation is 


Ci0 = - 15 + 46 C15 

Predictor Coef Stdev t-ratio 
Constant -15 2363 -0.01 
Ci5 45.9 344.5 0.13 

Ss = 366.5 R-sq = 0.2% R-sq(adj) = 0.0% 
Analysis of Variance 

SOURCE DF ss MS 
Regression 1 2381 2381 

Error 9 1208896 134322 

Total 10 1211277 


Unusual Observations 


Obs. C15 C10 Fit Stdev.Fit 
3 7.900 1271 306 l2l 
9 6.06 220 263 295 


R denotes an obs. with a large st. resid. 


Residual 
965 


~43 


St. Resid 
2.79R 
~Q0.20 X 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c10 2 c5 c6 


The regression equation i 
C10 = 401 -0.000024 C5 +. ” 000004 C6 


Predictor Coef Stdev t-ratio 
Constant 400.6 2diol 1.69 
c5 -0.00002395 0.00003585 -0.67 
C6 -0.00000377 0.00002108 -0.18 
S$ = 377.7 R-sq = 5.8% R-sq(adj) = 0.0% 


dee 


Analysis of Variance 


SOURCE DF ss MS 

Regression 2 69921 34960 

Error 8 1141356 142669 

Total 10 1211277 

SOURCE DF SEQ SS 

C5 1 65364 

C6 1 4557 

Unusual Observations 

Obs. C5 c10 Fit Stdev.Fit Residual St. Resid 
3 1347000 1271 331 128 940 2.65R 


R denotes an obs. with a large st. resid. 


MTB > plot c& cS 


:  @e 
1500000+ 


C4 : 


1000000+ 


500000+ 


= x 
0+ HR % % -- 
0 5000000 4000000 6000000 3000000 10000000 
MTB > plot c4& c6 


: on 
1500000+ 


C4 - 


1000000+ 


500000+ 


= a 
- X *% x% 
0+ x RR 36 > a 


- 
0 4000000 8000000 12000000 16000000 30000000 


has 


MTB > delete 3 


MTB > print cl- 
ROW Cl 
1 1 8 
2 2 
3 G 
G 5 1 
5 6 10 
6 7 
7 8 1 
8 9 
9 10 2 
10 11 3 
MTB > let c20 = 


MTB > hist cG 
Histogram of CG 


een Coun 
20000 
40000 
60000 
80000 
100000 
120000 
140000 
160000 
180000 
200000 


MTB > hist c20 
Histogram of C2 
"alia Coun 
50 

100 

150 


200 
250 


MTB > regr c4& 1 


cl=- c6 
c6 
C2 


433 
sort ( 


XN = 
t 


HOOF OORPRPRF GOIN) 


0 N 
+ 


K 


RFPOPrP RPE NRPRPE 
K KOK OK OK OK OK OK OK 


c5 


C3 


cG4) 


10 


The regression equation is 


C4 = 


Predictor 
Constant 
C5 -0.0 


Ss = 65390 


Coef 
754651 
05820 


R-sq 


Analysis of Variance 


3686365184 


SOURCE DF 
Regression 1 
Error 


Total 


75451 - 0.00582 C5 


8 34206769152 
9 37893132288 


0.006268 


= 9.7% R-sq(adj) = 0.0% 


C5 


8512000 
589000 
766000 

1349000 

10156000 


3433000 


t-ratio 
Ce 71 
-Q. 93 


MS 
3686365184 
4275846144 


119 


C6 


15411000 
6928000 
7947000 

20566000 
4896000 
5150000 

14083000 
1143000 
6091000 
5399000 


Unusual Observations 

Obs. C5 CG Fit Stdev.Fit Residual St.Resid 
9 2117000 199719 63130 21367 136588 2.21R 

R denotes an obs. with a large st. resid. 

MTB > regr c20 1 cS 


The regression equation is 
C20 = 233 -0.000010 C5 


Predictor Coef Stdev t-ratio 
Constant 232.73 60.37 3.86 

c5 -90.00001023 0.00001359 -0.75 

S = 141.8 R-sq = 6.6% R-sq(adj) = 0.0% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 11401 11401 

Error 8 160770 20096 

Total 9 172171 


MTB > regr c& 1 c6 


The regression equation is 
CG = 87153 - 0.00331 C6 


Predictor Coef Stdev t-ratio 
Constant 87153 38202 2.28 

C6 -0.003312 0.003662 -0.90 

S = 65554 R-sq = 9.3% R-sq(adj) = 0.0% 
Analysis of Variance 

SOURCE DF Ss Ms 
Regression 1 3514463488 3514463488 

Error 8 34378670080 4297330688 

Total 9 37893132288 


Unusual Observations 
Obs. c C4 Fit Stdev.Fit Residual St. Resid 
9 6091000 199719 66979 22921 132740 2.16R 
R denotes an obs. with a large st. resid. 

MTB > regr c20 1 c6 


The regression equation is 
C20 = 272 -0.000008 C6 


Predictor Coef Stdev t-ratio 
Constant 271.54 80.37 3.38 

C6 -0.00000791 0.00000770 -1.03 

S$ = 137.9 R-sq = 11.6% R-sqladj) = 0.6% 
Analysis of Variance 

SOURCE — DF ss MS 
Regression l 20022 290022 

Error 8 152149 19019 

Total 9 172171 


vas 


MTB > hist cS 
Histogram of C5 N = 10 


Midpoint Count 
1000000 6G ——- EEE EE 
2000000 1 x 
3000000 1 x 
4000000 0 
5000000 0 
6000000 0 
7000000 0 
8000000 0 
9000000 1 x 

10000000 1 x 


MTB > hist cé6 
Histogram of C6 N = 10 
Midpoint Count 


2000000 1 x 
4000000 1 x 
6000000 G KRXX 
8000000 1 x 
10000000 0 
12000000 0 
14000000 1 x 
16000000 1 x 
18000000 0 
20000000 1 x 


MTB > let c2l = sqrt(cS) 
MTB > hist c2l 


Histogram of C2l N = 10 
Midpoint Count 
800 


0 G RH 
1200 o x 
1600 1 x 
2000 1 x 
2400 0 
2800 1 x 
3200 1 x 


MTB > let c22 = logt(c5) 
MTB > hist c22 


Histogram of C22 N = 10 
Midpoint Count 


5. oO 3 RX 
6.0 o x 
6.2 1 x 
6.4 1 x 
6.6 1 x 
6.8 0 

7.0 o «x 


ia 


MTB > let c23 =1/c5 
MTB > hist c23 


Histogram of C23 N = 10 
Midpoint Count 


0.0000000 1 xX 
0.0000002 C RX 
0.0000004 1 & 
0.0000006 0 
0.0000008 Cc RR 
0.0000010 1 x 
0.00000l12 0 
0.0000014 Cc RR 
0.90000016 1 x 


MTB > let c24 =sqrt(c6) 
MTB > hist c2s4 


Histogram of C24 N = 10 
Midpoint Count 


1000 1 x 
1500 8) 

2000 1 x 
2500 G KKK 
3000 1 x 
3500 0 

G000 oC Xx 
4500 1 x 


MTB > let c25 = logt(c6) 
MTB > hist c25 


Histogram of C25 N = 10 
Midpoint Count 


6.0 1 x 
6.2 0 

6.4 0 

6.6 1 x 
6.8 G XXX 
Tae 1 x 
Tee oC Hx 
7.4 1 x 


MTB > regr c4 1 c2l 


The regression equation is 
C46 = 90066 - 21.1 C2l1 


Predictor Coef Stdev t-ratio 

Constant 90066 43348 2.08 

C21 -21.12 25.13 -0.84 

S = 65973 R-sq = 8.1% R-sq(adj) = 0.0% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 1 39073592064 39073592064 

Error 8 34819543040 46352442368 

Total 9 37893132288 

Unusual Observations 

Obs. C21 C4 Fit Stdev.Fit Residual 
9 1455 199719 59339 20912 140380 


R denotes an obs. with a large st. resid. 


Md 2 


St. Resid 
R 


e 


MTB > r@er.c2z0 1 cal 


The regression equation is 
C20 = 258 =~ 0.0367 C2l 


Predictor Coef Stdev t-ratio 
Constant 257.70 93.76 2.75 
C21 ~0.03665 0.05436 ~0.67 
S$ = 142.7 R-sq = 5.4% R-sqladj) = 0.0% 


Analysis of Variance 


SOURCE DF ss MS 
Regression 1 9258 9258 
Error 8 162913 20364 
Total 9 172171 


MTB > regr cG 1 c25 


The regression equation is 
CG = 261352 - 29724 C25 


Predictor Coef Stdev t-ratio 
Constant 261352 441563 0.59 
C25 -29724 64511 -0.46 
Ss = 67928 R-sq = 2.6% R-sq(adj) = 0.0% 


Analysis of Variance 


SOURCE DF Ss MS 

Regression 1 979619328 979619328 

Error 8 36913516544 4614189056 

Total 9 37893132288 

Unusual Observations 

Obs. C25 CG Fit Stdev.Fit Residual St.Resid 
8 6.06 48526 81280 54633 -32754G ~0.81 X 
9 6.78 199719 59681 21741 140038 2.18R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr c20 1 c25 


The regression equation is 
C20 = 838 ~- 93 C25 


Predictor Coef Stdev t-ratio 
Constant 837.7 926.7 0.90 
C25 -92.9 135.4 -0.69 

Ss = 142.6 R-sq = 5.6% R-sq(adj) = 0.0% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 9578 9578 

Error 8 162593 20324 

Total 9 172171 


iS 


X denotes an obs. whose X value gives it large influence. 
MTB > rege c2zZ0 2 cS ce 


The regression equation is 
C20 = 297 -0.000010 C5 -0.000008 C6 


Predictor Coef Stdev t-ratio 
Constant 297.24 90.92 Seed 
cS -0.00000952 0.00001369 -~0.70 
C6 -0.00000760 0.00000798 -0.95 
S = 142.6 R-sq = 17.3% R-sqladj) = 0.0% 
Analysis of Variance 

SOURCE DF ss Ms 
Regression 2g 29867 14933 
Error 7 142304 20329 
Total 9 172171 

SOURCE DF SEQ. S$ 

GS a 11401 

C6 Bf 18466 


MTB > plot c4 c5 


. & 


180000+ 
C4 - - 


120000+ 


- % 
60000+ * 


- x 
- * * 
0+ x x 


0 5000000 4000000 6000000 3000000 10000000 
MTB > plot ¢4 c6 . 
C) #9 
1800004 
C4 : 


1200004 


= x 


60000+ * 


= x 
~ x x 
O+ x Pe 


a 
0 4000000 8000000 12000000 16000000 30000000 
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MTB > delete 9 cl-c6 
MTB > print ci-cS 


ROW Ci C2 C3 CG 
1 1 8512 15411 10154 
2 2 589 6928 
3 G 766 7947 132298 
Q 5 1349 20566 4950 
5 6 10156 4896 24064 
6 7 731 5150 88581 
7 8 1116 14083 55536 
8 9 985 1143 48526 
9 11 3433 5399 17520 
MTB > let ¢30 = sqrt(cG) 
MTB > hist cG 
Histogram of CG N= 9 
Midpoint Count 
0 2 x 
20000 3 XX 
40000 1 x 
60000 1 x 
80000 1 x 
100000 0 
120000 0 
140000 1 x 
MTB > hist c30 
Histogram of C30 N= 9 
Midpoint Count 
0 1 x 
50 1 x 
100 1 x 
150 2 xx 
200 1 x 
250 1 x 
300 1 x 
350 H x 
MTB > regr c4 1 c5 
The regression equation is 
C4 = 56571 - 0.00461 C5 
Predictor Coef Stdev 
Constant 56571 19431 
-0.004614 0.004197 
S = 43622 R-sq = 14.7% 


Analysis of Variance 


C5 


8512000 
589000 
766000 

1349000 

10156000 
731000 

1116000 
985000 

3433000 


t-ratio 
2.91 
-i1,. 10 


R-sq(adj) = 2.5% 


SOURCE DF ss MS 
Regression 1 2299405056 2299405056 
Error 7 13320302592 1902900224 
Total 8 15619706880 


2 


C6 


15411000 
6928000 
7947000 

20566000 
4896000 
5150000 

14083000 
1143000 
5399000 


MTB > regr c30 1 c5 


The regression equation is 
C30 = 200 -0.000008 C5 


Predictor Coef Stdev t-ratio 
Constant 200.14 52.84 3.79 
C5 -0.00000815 0.00001141 -0.71 
Ss = 118.6 R-sq = 6.8% R-sq(adjJ = 0.0% 


Analysis of Variance 


SOURCE DF Ss MS 
Regression 1. 7179 7179 
Error 7 98505 14072 
Total 8 105684 


MTB > regr cG 1 c6 


The regression equation is 
CG = 60926 - 0.00205 C6 


Predictor Coef Stdev t-ratio 
Constant 60986 27635 2.21 
C6 -0.002052 0.002557 =-0.80 
s = 45205 R-sq = 8.4% R-sq(adj) = 0.0% 


Analysis of Variance 


SOURCE DF Ss MS 
Regression 1 1315051520 1315051520 
Error 7 14304653312 2043521792 
Total 8 15619702784 


Unusual Observations 

Obs. C4 Fit Stdev.Fit Residual St.Resid 
3 7947000 132298 44683 15334 87615 2.06R 

R denotes an obs. with a large st. resid. 

MTB > regr c30 1 c6 


The regression equation is 
C30 = 227 -0.000006 C6 


Predictor Coef Stdev t-ratio 
Constant 227.48 Tie S0 3.19 

C6 -0.00000578 0.00000660 -0.88 

s = 116.6 R-sq = 9.9% R-sq(adj) = 0.0% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 10449 10449 

Error 7 95235 13605 

Total 8 105684 


126 


MTB > hist c5 
Histogram of C5 N= 9 


Midpoint Count 
1000000 
2000000 
3000000 
4000000 
5000000 
6000000 
7000000 
8000000 
9000000 

10000000 


MTB > hist c6 
Histogram of C6 N= 9 


HE HE EE EE 


* 


RPrOODOOOCOrFOHO 


Midpoint Count 
2000000 
4000000 
6000000 
8000000 

10000000 

12000000 

14000000 

16000000 

18000000 

20000000 


MTB > let c31 = sart(c5) 
MTB > hist c3l 


Histogram of C31 N= 9 


KMOrRFROORtU HH 


Midpoint Count 


800 G KX 
1200 2 xX 
1600 0 
2000 l x 
2400 0 
2800 l x 
3200 l x 


MTB > let c32 = logt(c5) 
MTB > hist c32 


Histogram of C32 N= 9 
Midpoint Count 


5.8 3 RX 
6.0 a xx 
6.2 l x 
6.4 0 

6.6 1 x 
6.8 0 

7.0 2 x 


Lay 


MTB > let c33 = 17c5 
MTB > hist c33 


Histogram of C33 N= 9 
Midpoint Count 


0.0000000 1 x 
0.0000002 2 x 
0.00000046 0 
0.0000006 0 
0.0000008 oc x 
0.0000010 1 xX 
0.0000012 0 
0.0000014 2 x 
0.0000016 l x 


MTB > let c346 = saqrt(c6) 
MTB > hist c34 


Histogram of C34 N=9 
Midpoint Count 


1000 l x 
1500 0 

2000 1 x 
2500 3 xx 
3000 1 x 
3500 0 

4000 2 Xx 
4500 1 x 


MTB > regr c4 1 c3l 


The regression equation is 
CG = 72507 - 19.8 C31 


Predictor Coef Stdev t-ratio 
Constant 72507 28676 2.53 
C31 -19.83 16.36 -1.21 

S = 42948 R-sq = 17.3% R-sqladj) = 5.5% 
Analysis of Variance 

SOURCE DF ss MS 
Regression 1 2707759616 2707759616 

Error 7 12911967776 18464563968 

Total 8 15619706880 


MTB > regr c30 1 c3l 


The regression equation is 
C30 = 227 - 0.0344 C31 


Predictor Coef Stdev t-ratio 
Constant 227.36 78.81 2.89 
C31 -0.034462 0.044697 -0.77 
S$ = 118.0 R-sq = 7.7% R-sqladj) = 0.0% 


2 


Analysis of Variance 


SOURCE DF Ss MS 
Regression 1 8161 8161 
Error Wi 97524 13932 
Total 8 105684 


MTB > regr c4 1 c34 


The regression equation is 
C46 = 71380 - 10.2 C34 


Predictor Coef Stdev t-ratio 
Constant 71380 45808 1.56 
C34 -10.21 15.22 -0.67 
Ss = 45788 R=-sq = 6.0% R=-sqladj) = 0.0% 


Analysis of Variance 


SOURCE DF ss MS 

Regression l 943696128 943696128 

Error 7 14676008960 2096572672 

Total 8 15619702784 

Unusual Observations 

Obs. C34 C4 Fit Stdev.Fit Residual St. Resid 
3 2819 132298 42594 15265 89704 2.08R 


R denotes an obs. with a large st. resid. 


MTB > regr c30 1 c34 


The regression equation is 
C30 = 265 - 0.0316 C34 


Predictor Coef Stdev t-ratio 
Constant 264.7 117.6 2.25 
C34 -0.03159 0.03906 -0.81 

s = 117.5 R-sq = 8.5% R=-sqladj) = 0.0% 
Analysis of Variance 

SOURCE DF ss MS 
Regression 1 9029 9029 

Error 7 96656 13808 

Total 8 105684 


MTB > regr cG 2 c3l1 c34 


The regression equation is 
C4 = 95595 - 18.9 C31 - 8.6 C34 


Predictor Coef Stdev t-ratio 
Constant 95595 50336 1.90 
C31 -18.88 17.29 -1.09 
C34 -8.64 15.08 -0.57 
s = 45171 R-sq = 21.6% R-sqladj) = 0.0% 


acc, 


Analysis of Variance 


SOURCE 


Regression 


Error 
Total 


SOURCE 
C31 
C34 


DF Ss MS 
2 3377426688 1688713216 
6 12242280448 2040379904 
8 15619706880 

DF SEQ SS 
1 2707759616 
1 669666816 


MTB > plot c4& c5 


120000+ 
C4 = 
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Ot 


MTB > plot c4& c6 
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APPENDIX F 


REGRESSION ON ALL PROJECTS 


Cl NUMERICAL LISTING OF THE PROJECTS 
C2 THE A/”E COSTS IN THOUSANDS 
C3 THE CONSTRUCTION COSTS IN THOUSANDS 
CG A/E LIABILITY COSTS 
C5 THE A/E COSTS 
C6 THE CONSTRUCTION COSTS 
MTB > let c5 =1000x%c2 
MTB > let c6 = 1000x%c3 
MTB > print cl1-c6 
ROW Ci C2 C3 CG C5 
1 1 95 48 0 95000 
2 2 116 584 3520 116000 
3 3 250 6210 0 250000 
G G 166 3180 0 166000 
5 5 13 3524 100000 13000 
6 6 412 6882 100000 412000 
7 7 26 320 30800 26000 
8 8 364 31789 364000 
9 9 234 3664 162001 234000 
10 10 551 7726 89608 551000 
11 11 190 2066 4693 190000 
12 12 488 5451 40826 488000 
13 13 177 1386 17240 177000 
14 14 1869 7984 12568 1869000 
15 15 261 2496 261000 
16 16 G 120 3079 4000 
17 17 415 460G 9331 415000 
18 18 421 2860 20327 421000 
19 19 861 12264 74980 861000 
20 20 417 3607 1007 417000 
21 21 150 1908 25684 150000 
22 22 26 153 1615 26000 
23 23 G 967 73154 4000 
24 24 289 4935 5453 289000 
25 25 1141 8001 123555 1141000 
26 26 421 1737 5159 421000 
27 27 198 847 2757 198000 
28 28 8G 619 2441 84000 
29 29 G09 4567 8913 409000 
30 30 107 85G 19714 107000 
31 31 159 87 0 159000 
32 32 472 1295 63558 472000 
33 33 108 133 2160 108000 
34 34 192 161 7200 192000 
35 1 1166 1308 4380 1166000 
36 2 403 3470 0 403000 
37 3 1030 1745 5389 1030000 
38 G 9670 10100 21000 9670000 
39 5 1050 13236 0 1050000 
GO 6 14 2426 25500 14000 
Gl 7 391 5454 12191 391000 
42 8 578 7690 67519 578000 
43 9 636 8833 0 636000 
GG 10 147 1945 0 147000 
45 11 352 1774 115896 352000 
G6 12 374 2835 188610 374000 
47 13 561 5158 0 561000 
48 14 48 398 0 48000 
G9 15 723 4300 13498 723000 


IPS Al 


C6 


48000 
584000 
6210000 
3180000 
3524000 
6882000 
320000 


31788992 


3664000 
7726000 
2066000 
5451000 
1386000 
7984000 
2496000 

120000 
4604000 
2860000 


12264000 


3607000 
1908000 
153000 
967000 
4935000 
8001000 
1737000 
847000 
619000 
4567000 
854000 
87000 
1295000 
133000 
161000 
1308000 
3470000 
1745000 


10100000 
13236000 


2426000 
5454000 
7690000 
8833000 
1945000 
1774000 
2835000 
5158000 

398000 
4300000 


MTB > let c10 = 


WOO~IGU HGH 


10 
rd 


1349 
10156 
731 
LLL6 
985 
2117 
3433 


Sq 


MTB > hist cG 


Histogram of C4 
Each * represents 2 obs. 


NtGrOs a 


200000 
400000 
600000 
800000 
1000000 
1200000 
1400000 
1600000 


Count 
78 
11 


HK OOOrROO 


6091 
5399 


rt(cG) 


N= 91 


0 
1616460 
132298 
4950 
24064 
88581 
55536 
48526 
199719 
17520 


1354000 


52000 
8512000 
589000 
1347000 
766000 
1349000 


10156000 


731000 
1116000 
985000 
2117000 
3433000 


7644000 
957000 
3575000 
3925000 
2107000 
G613000 
5788000 
3699000 
4075000 
6692000 
6556000 
1008000 
977000 
3905000 
1669000 
651000 
4666000 
791000 
2634000 
511000 
4203000 
1591000 
712000 
2066000 
5549000 
1394000 
41000 
701000 
3773000 
658000 
522000 


15411000 


6928000 
9912000 
7947000 


20566000 


4896000 
5150000 


14083000 


1143000 
6091000 
5399000 


HHH HH HHH HH HEHE HE HEHE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE 
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MTB > hist c10 
Histogram of C10 N= 91 
meopoint Count 


VAM TT TTSSSSS TTS TTT Tse TTT sts 


100 Ki TSSTOSCS SSS SS SCSCS SSS SSS SSSSSSSSS sss: 
200 10 RRM KM 
300 13 RRM MMMM RX 
G00 G6 XRRKRX 
500 1 x 
600 0 
700 0 
800 0 
900 1 x 
1000 0 
1100 0 
1200 0 
1300 1 x 


MTB > regr c4 1 c5 


The regression equation is 
C4 = 55884 + 0.0039 C5 


Predictor Coef Stdev t-r 
Constant 55884 21892 
c5 0.00387 0.01176 
Ss = 189558 R-sq = 0.1% R-sqladj) 


Analysis of Variance 


SOURCE DF S MS 
Regression 1 3880218880 3880218880 
Error 89 3.197973Et12 35932282880 
Total 90 3.201853E+12 


Unusual Observations 
OS CG 


S. Fit Stdev.Fit 
38 9670000 21000 93262 106430 
74 773000 766336 58872 19871 
81 8512000 10154 88786 93083 
83 1347000 1616460 61091 20956 
86 10156000 24064 95141 112051 

R denotes an obs. with a large st. resid. 


atio 
2.55 
0.33 


= 0 


Re 


0% 


Sidual 
-72262 
707464 
-78632 


1555369 


-71077 


St. Resid 
-0.46 X 
27 5R 
-0.48 X 
8.26R 
-0.46 X 


X denotes an obs. whose X value gives it large influence. 


MTB > regr ci0 1 c5 


The regression equation is 
C10 = 149 +0.000008 C5 


Predictor Coef Stdev 
Constant 148.75 rede ley AS 
c5 0.00000805 0.00001169 
Ss = 188.3 R-sq = 0.5% 
Analysis of Variance 

SOURCE ~ DF Ss 
Regression 1 16845 

Error 89 3155833 

Total 90 3172678 


t-r 


MS 
16845 
35459 


Miers 


atio 
6.84 
0.69 


R-sq(adj) = 0.0% 


Unusual Observations 
cS 


Obs. C10 Fit Stdev.Fit Residual 
38 9670000 144.9 226.6 105.7 -81.7 
74 773000 875.4 155.0 19.7 720.4 
81 8512000 100.8 27. 92.5 -116.5 
83 1347000 1271.4 159.6 20.8 1111.8 
86 10156000 155.1 230.5 Litas -75.G4 


R denotes an obs. with a large st. resid. 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c4 1 c6 


The regression equation is 


C4 = 35833 + 0.00540 C6 

Predictor Coef Stdev t-ratio 
Constant 35833 26501 1.35 
C6 0.005399 0.004149 1.30 
S = 187894 R-sq = 1.9% R-sq(adj)} = 0 


Analysis of Variance 


SOURCE DF SS MS 
Regression 1 59777138688 59777138688 
ERFor 89 3.1G2077E+12 35304230912 
Total 90 3.201853E+t12 


Unusual dae ald 
C 


Obs. Fit Stdev.Fit Residual 
8 31788992 0 207449 115845 ~-207449 
74 5549000 766336 65790 20395 700546 
81 15411000 10154 119031 50231 -108877 
83 9912000 1616460 89344 30581 1527116 
85 20566000 4950 146861 70407 -141911 


R denotes an obs. with a large st. resid. 


8% 


St. 


Resid 


~1.40 X 


X denotes an obs. whose X value gives it large influence. 


MTB > regr cl10 1 c6 


The regression equation is 


C10 = 134 +0.000005 C6 

Predictor Coef Stdev t-ratio 

Constant 133.56 26.41 5.06 

C6 0.00000503 0.00000413 122 

S$ = 187.3 R-sq = 1.6% R-sq(adj) = 0.5% 

Analysis of Variance 

SOURCE _ DF Ss MS 

Regression i 51855 51855 

Error 89 3120823 35065 

Total 90 3172678 

Unusual Observations 

Obs. C C10 Fit Stdev.Fit Residual 
8 31788992 0.0 293.4 115.5 -293.4% 
74 5549000 875.4% 161.5 20.3 713.9 
81 15411000 100.8 211.0 50.1 -110.3 
83 9912000 1271.4 183.4 30.5 1088.90 
85 20566000 70.4 237.0 70.2 -166.6 

R denotes an obs. with a large st. resid. 


3.84R 


St. Resid 
9 X 


~0.61 X 


X denotes an obs. whose X value gives it large influence. 


134 


5.89R 
0.96 


X 


MTB > hist cS 


Histogram of C5 
Fach * represent 
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MTB > hist c6 
Histogram of C6 


Midpoint Count 
0 36 
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MTB > let cli = 
MTB > let cl2 = 


N= 91 
2 obs. 
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MTB > let c13= 1/c5 


MTB > hist cll 
Histogram of Cll 
Midpoint gouns 
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MTB > Rist cle 
Histogram of Cl2 
Midpoint Count 
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MTB > hist cl13 


Histogram of C13 N= 91 
Each * represents 2 obs. 
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MTB > let cl4 = sqrt(c6) 
MTB > let c1l5 =logt(c6) 
MTB > let cl1l6= 1/7c6 

MTB > hist c14 


Histogram of C14 N= 91 
Midpoint Count 
0 a x 
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MTB > hist ci5 
Histogram of C15 N= 91 


Midpoint Count 
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MTB > hist cl6 


Histogram of C16 N=9] 
Each * represents 2 obs. 
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MTB > regr cG 1 cl2 


The regression equation is 


CG = = 213919 + 49887 Cl2 

Predictor Coef Stdev t-ratio 
Constant -213919 174134 : -1.23 
Cle 49887 31638 1.58 

Ss = 187078 R-sq = 2.7% R-sq(adj) = 1.6% 


Analysis of Variance 


SOURCE DF SS MS 

Regression 1 87012343808 87012343808 

Error 89 3.1148G41E+12 34998214656 

Total 90 3.201853E+1l2 

Unusual Observations 

Obs. Cle CG Fit Stdev.Fit Residual St.Resid 
16 3.60 3079 -~34225 62233 3730G 0.21 X 
23 3.60 73154 -~34225 62233 107379 0.61 X 
38 6.99 21000 134559 51835 -113559 -0.63 X 
7G 5.89 766335 79822 23677 686514 3.70R 
81 6.93 10154 131796 50217 ~121642 -0.67 X 
83 6.13 1616460 91854 28658 1524606 8.25R 
86 7.01 2406G 135622 52459 -111558 -0.62 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr cl10 1 cl2 


The regression equation is 


C10 = = 211 + 66.9 Cl2 

Predictor Coef Stdev t-ratio 
Constant -211.0 171.3 -1.23 
Cl2 66.9G $1.13 2.15 

s = 184.1 R-sq = 4.9% R-sqladj) = 3.9% 
Analysis of Variance 

SOURCE DF ss MS 
Regression 1 156660 156660 

Error 89 3016018 33888 

Total 90 3172677 


er 


Unusual Observations 


Obs. Cl2 E10 Fit Stdev.Fit Residual 
16 3.60 55.5 $0.1 61.2 25.4 
23 3.60 270.5 30.1 61.2 240.4 
38 6.99 144.9 256.6 51.0 -111.6 
74 5.89 875.4% 183.1 23.3 692.3 
81 6.93 100.8 252.9 49.4% -152.1 
83 6.13 1271.4 199.3 28.2 1072.1 
86 7.01 155.1 258.0 51.6 -102.9 

R denotes an obs. with a large st. resid. 


X denotes an obs. whose X value gives it large influence. 


MT3 > regr ¢c4 1 14 


The regression equation is 


CG = = 4763 + 35.2 C14 
Predictor Coef 
Constant -4763 

C14 35.15 

S$ = 186337 R-sq = 3.5% 


Analysis of Variance 


Stdev t-ratio 
40523 “0.12 
19.690 1.79 


R-sqladj) = 2.4% 


SOURCE F ss Ms 

Regression 1 111652241408 111652241408 

Error 89 3.090202Et12 34721366016 

Total 90 3.201853E+12 

Unusual Observations 

Obs. C14 C4 Fit Stdev.Fit Residual 
8 5638 0 193428 77519 -193428 
74 2356 766336 78041 22259 688295 

83 3148 1616460 105906 32689 1510554 

85 46535 4950 154648 56854 -149698 


R denotes an obs. with a large st. 


resid. 


St.Resid 
=-l. 14 X 

3.72R 

8.23R 
-0.84 X 


X denotes an obs. whose X value gives it large influence. 


MTB > regr cl10 1 c¢14 


The regression equation is 


C10 = 79.0 + 0.0420 C14 
Predictor Coef 
Constant 79.01 

C14 0.04198 0. 
Ss = 184.0 R-sq = 5.0% 


Analysis of Variance 


Stdev t-ratio 
40.02 1.97 
01936 2. le 


R-sq(adj) = 4.0% 


SOURCE DF ss MS 

Regression 1 159240 159240 

Error 89 3013438 33859 

Total 90 3172677 

Unusual Observations 

Obs. 14 C10 Fit Stdev.Fit Residual 
8 5638 0.0 315.7 ; -315. 
74 2356 875.4 177.9 22.0 697.5 
83 3148 1271.4 211.2 32.3 1060.2 
85 4535 70.4% 269.45 56.1 -199.0 

R denotes an obs. with a large st. resid. 


X denotes an obs. whose X value gives it large influence. 


RS) 5 


MTB > regr cl0 2 cl2 cl4 


The regression equation is 


C10 = = 112 + 40.3 Cl2 + 0.0260 C14 

Predictor Coef Stdev t-ratio 
Constant -112.5 196.0 -0.57 
Cl2 40.30 40.38 1.00 
C14 0.02601 0.02512 1.04 

S$ = 184.0 R-sq = 6.1% R-sq(adj) = 3.9% 


Analysis of Variance 


SOURCE DF Ss MS 

Regression 2 192972 96486 

Error 88 2979706 33860 

Total 90 3172677 

SOURCE DF SEQ SS 

Cl2 1 156660 

C14 1 36312 

Unusual Observations 

Obs. Ci2 Cio Fit Stdev.Fit Residual St. Resid 
5 4.11 316.2 102.2 59.0 214.1 1.23 X 
8 5.56 0.0 258.3 95.7 -258.3 -1.64 X 
16 3.60 55.5 G1.7 62.2 13.8 0.08 X 
23 3.60 270.5 58.3 67.90 212.2 1.24 X 
74 5.89 875.4 186.1 23.5 689.3 3./78R 
83 6.13 1271.4 216.4% Se. / 1055.90 5.83R 
85 6.13 70.4% 252.5 58.6 -182.2 -1.04 X 
86 7.01 155.1 227.5 59.G -/2.3 -0.42 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > plot cG4 c5 


- Ow 
1500000+ 


C4 = 


1000000+ 


- Om 


500000+ 


— KKK x 


= "2 Osgoacx 1 86 
Ot +4+4+52GK2 % 
a a et ee ae as as as ap an ap & ae ae Gf ff a &e oP ae et ee ee ae ee & a “— an Ge Ge ae ee gs Gs = + = ae == as oe oP ae ae ae ae == = C5 
0 


0 200000 4000000 6000000 8000000 10000000 


RESIS, 


MTB > plot c4& cé 
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C4 - 
1000000+ 
- @* 
500000+ 
- x xX % 
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0+ {1/336 e % xX * * * * 
0 Bae xs baOG0G 18000000 34000000 30000000 
MTB > delete 86 cl-c6 
MTB >» delete 83 cl-c6 
MTB > delete 81 cl-c6 
MTB >» delete 74 cl-c6 
MTB > delete 38 cl-c6 
MTB > print cl-c6 
ROW Cl C2 C3 CG C5 C6 
1 1 95 G48 0 95000 48000 
2 2 116 584 3520 116000 584000 
3 3 250 6210 250000 6210000 
G G 166 3180 166000 3180000 
5 5 13 3524 100000 13000 3524000 
6 6 412 6882 100000 412000 6882000 
7 7 26 320 30800 26000 320000 
8 8 364 31789 364000 31788992 
9 9 234 3664 162001 234000 3664000 
10 10 551 7726 89608 551000 7726000 
11 ll 190 2066 G693 190000 2066000 
le 12 488 5451 40826 488000 5451000 
13 13 177 1386 17240 177000 1386000 
14 14 1869 198G 12568 1869000 7984000 
15 15 261 2496 0 261000 2496000 
16 16 G 120 3079 4000 120000 
17 17 415 4604 9331 415000 4604000 
18 18 G2l 2860 20327 421000 2860000 
19 19 861 12264 74980 861000 12264000 
20 20 417 3607 1007 417000 3607000 
21 21 150 1908 25684 150000 1908000 
22 22 26 153 1615 26000 153000 
23 23 G 967 73154 4000 967000 
24 24 289 4935 5453 289000 4935000 
25 25 1141 8001 123555 1141000 8001000 
26 26 421 1737 5159 421000 1737000 
27 27 198 847 2757 198000 847009 
28 28 8G 619 2441 84000 619000 
29 29 G09 4567 8913 409000 4567000 
30 30 107 854 19714 107000 854000 
31 31 159 87 159000 87000 
32 32 G72 1295 63558 472000 1295000 
33 33 108 133 2160 108000 133000 
3G 34 192 161 7200 192000 161000 
35 1 1166 1308 4380 1166000 1308000 
36 2 403 3470 403000 3470000 
37 3 1030 1745 5389 1030000 1745000 
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38 5 1050 13236 0 1050000 13236000 
39 6 14 2426 25500 14000 2426000 
40 7 391 5454 sped Ma 391000 5454000 
41 8 578 7690 67519 578000 7690000 
42 9 636 8833 0 636000 8833000 
43 10 147 1945 0 147000 1945000 
44 Wal 352 1774 P2696 352000 1774000 
45 re 374 2835 188610 374000 2835000 
46 13 561 5158 0 561000 5158000 
47 14 G8 398 0 48000 398000 
48 15 723 4300 13498 723000 4300000 
G9 16 1354 7644 3720 1354000 7644000 
=) a 65 957 0 65000 957000 
=i 18 368 3575 74675 368000 3575000 
oc 19 361 3925 8528 361000 3925000 


53 20 427 2407 11180 427000 2107000 
54 el 163 4613 1>ou 2 763000 4613000 
Lye 22 313 5788 88962 313000 5788000 
56 23 352 3699 41016 352000 3699000 
Wj 24 447 4075 101121 447000 4075000 
58 25 580 6692 93000 580000 6692000 


59 26 104 6556 0 104000 6556000 
60 27 70 1008 0 70000 1008000 
61 28 73 977 29a 73000 977000 


64 31 ala bal 651 4450 111000 651000 
65 32 494 4666 623 494000 4666000 
66 33 77 791 6551 77000 791000 
67 34 1494 2634 146034 1494000 2634000 
68 35 61 pal 3048 61000 511000 
69 36 626 4203 5488 626000 4203000 
70 37 157 1591 7373 157000 1591000 
re 38 93 a2 204700 93000 712000 
Te 39 213 2066 1370 213000 2066000 
73 41 137 1394 5961 187000 1394000 
74 42 68 41 0 68000 41000 
1S 43 80 701 1520 80000 701000 
76 44 525 3773 9608 525000 3773000 
77 G5 65 555 1431 65000 658000 
78 46 52 Bice 18503 52000 522000 
79 Z 589 6928 0 589000 6928000 
80 4 766 71947 132298 766000 7947000 
81 5 1349 20566 4950 1349000 20566000 
82 7 731 5150 885381 731000 5150000 
83 8 1116 14083 55536 1116000 14083000 
84 9 985 1143 48526 985000 1143000 
85 10 rad bal 7 6091 199719 2117000 6091000 
86 ag 3433 5399 17520 3433000 5399000 


MTB > let c20 = sqrt(c4) 
MTB > hist c4 


Histogram of C4 N = 86 
Hecmatnt Count 
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MTB > hist c20 
Histogram of C20 N = 86 


Midpoint Count 
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MTB > regr cG 1 c5 


The regression equation is 
CG = 24212 + 0.0207 C5 


Predictor Coef Stdev t-ratio 
Constant 24212 7139 3.39 
C5 0.02068 0.01001 2.06 
S$ = 49579 R=-sq = 4.8% R-sqladj) = 3.7% 


Analysis of Variance 


SOURCE DF ss MS 

Regression 1 10478141440 10478141440 

Error 8G 206479884283 24958093824 

Total 85 216957976576 

Unusual Observations 

Obs. C5 CG Fit Stdev.Fit Residual St. Resid 
9 234000 162001 29050 5855 132951 2.70R 
14 1869000 12568 62857 14974 -50289 -1.06 X 
G5 374000 188610 31945 5436 156665 3.18R 
71 93000 204700 26135 6559 178565 3.63R 
85 2117000 199719 67984 17316 131735 2. 84RX 
86 3433000 17520 95195 30128 -/7675 -1.97 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr c20 1 cS 


The regression equation is 
C20 = 107 +0.000058 C5 


Predictor Coef Stdev t-ratio 
Constant 106.93 17.81 6.00 

C5 0.00005824 0.00002498 2.33 

S$ = 123.7 R-sq = 6.1% R<sqladj) = 5.0% 
Analysis of Variance 

SOURCE DF ss MS 
Regression i 83125 83125 

Error 8G 1284803 15295 


Total 85 1367927 


Tu2 


Unusual Observations 


Obs. cs C20 Fit Stdev.Fit Residual St. Resid 
9 234000 402.5 120.6 14.6 281.9 2.30R 
1G 1869000 Te. 2 215.8 37.4 -103.7 -0.88 X 
45 374000 G34.3 Veo. / 13.6 305.6 2.G9R 
71 93000 G52.G 2.3 16.4 340.1 2.77R 
85 2117000 GG6.9 230.2 G3.2 216.7 Or x 
86 3433000 132.4 306.9 75.2 “174.5 -1.78 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr cG 1 c6 


The regression equation is 
CG = 30614 + 0.00084 C6 


Predictor Coef Stdev t-ratio 
Constant 30614 7206 G.25 
C6 0.000844 0.001177 0.72 
s = 50667 R-sq = 0.6% R-sq(adj) = 0.0% 


Analysis of Variance 


SOURCE DF ss MS 

Regression 1 1318367744 1318367744 

Error 8G 215639654400 2567138560 

Total 85 216957976576 

Unusual Observations 

Obs. C6 CG Fit Stdev.Fit Residual St.Resid 
8 31788992 0 57433 33182 -57433 “1.50 X 
9 3664000 162001 33705 5477 128296 2.55R 
G5 2835000 188610 33006 5630 155604 3.09R 
67 2634000 146036 32836 5692 113198 2.25R 
71 712000 204700 31215 6688 173485 3.45R 
81 20566000 4950 47965 20267 -43015 -0.93 X 
85 6091000 199719 35753 5998 163966 3.26R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 
MTB > regr c20 1 c6 


The regression equation is 
C20 = 127 +0.000002 Cé 


Predictor Coef Stdev t-ratio 
Constant 127.42 13231 7.04 

C6 0.00000176 0.00000296 0.59 

Ss = 127.3 R-sq = 0.4% R-sqladj) = 0.0% 
Analysis of Variance 

SOURCE — DF Ss MS 
Regression 1 5730 5730 

Error 8G 1362198 16217 

Total 85 1367928 
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Unusual Observations 
C6 C 


Ss. Fit Stdev.Fit Residual St.Resid 
8 31788992 0.0 133.3 83.4 -183.3 “1.91 X 
9 3664000 402.5 133.9 13.8 268.6 2.1eR 
45 2835000 434.3 132.4 14.1 301.9 2.39R 
71 712000 452.4 128.7 16.8 323.8 2.56R 
81 20566000 70.4 163.6 50.9 -93.2 -0.80 X 
85 6091000 GG6.9 138.1 15.1 308.8 2.4GR 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > hist c5 
Histogram of C5 N = 86 
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MTB > hist c6 
Histogram of Cé N = 86 
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MTB > let c2l = logt(c5} 
MTB > hist cel 


Histogram of Cel N = 86 
Midpoint Co 
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MTB > let c22 = 
MTB > hist c22 


Histogram of C22 N = 86 


sqrt(c6) 


Midpoint Count 
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MTB > regr cG 1 cé2l 


The regression equation is 


CG = - 57772 + 16979 C2l 

Predictor Coef Stdev t-ratio 
Constant -57772 51989 -l.1ll 
C21 16979 9569 eer 7 

s = 49895 R-sq = 3.6% R-sq(adj) = 2.5% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 7837642752 7837642752 

error 8G 209120395264 2489528320 


Total 85 216957976576 


Unusual Observations 


Obs. Cal CG 
5 4.11 100000 12079 
9 5.37 162001 33392 
16 3.60 3079 3387 
23 3.60 73154 3387 
39 4.15 25500 12625 
G5 =. 57 188610 36850 
67 6.17 146034 47062 
71 G.97 204700 26588 
SS 6.33 199719 49632 


R denotes an obs. with a large st. 


Fit Stdev.Fit 
13465 


5390 


18061 
138061 
13183 


resid. 


Residual 


178112 
150087 


St. Resid 
X 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c20 1 cé2l 


The regression equation is 


C20 = = 120 + 47.1 C21 

Predictor Coef Stdev 
Constant -120.0 130.90 
C2l 47.09 23.93 
s = 124.8 R-Ssq = 4.4% 
Analysis of Variance 

SOURCE — DF Ss 
Regression l 60280 

Error 84 1307648 

Total 85 1367928 


t-ratio 


MS 


602380 
15567 


L45 


0.92 
1.97 


R-sq(adj) = 3.3% 


Unusual Observations 


Obs. C21 C20 Fit Stdev.Fit Residual St.Resid 
5 4.11 316.2 73.7 33. 242. 2.02RX 
9 5.37 402.5 132.8 13.5 269.7 2.17R 
16 3.60 55.5 49.6 45.2 5. 0.05 X 
23 3.60 270.5 49.6 45.2 220.9 1.90 X 
39 4.15 159.7 75.2 33.0 84.5 0.70 X 
G45 5.57 4634.3 142.4% 14.0 291.9 2.35R 
71 4.97 452.4% 113.9 17.0 338.5 2.74R 
85 6.33 446.9 1 FF ao: 25.8 269.1 2.20R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr c4 1 c22 


The regression equation is 
CG = 18256 + 9.01 C22 


Predictor Coef Stdev t-ratio 
Constant 18256 11078 1.65 
C22 9.015 5.547 1.63 
s = 50041 R-sq = 3.0% R-sqladj) = 1.9% 


Analysis of Variance 


SOURCE DF Ss MS 

Regression 1 6613970944 6613970944 

Error 84 210344083456 2504096000 

Total 85 216958042112 

Unusual Observations 

Obs. C22 C4 Fit Stdev.Fit Residual St. Resid 
8 5638 0 69082 22263 -69082 -1.54 X 
9 1914 162001 35511 5478 126490 2.-54R 

45 1684 188610 33434 5406 155176 3.12R 
67 1623 146034 32886 5438 113148 2.27R 
71 844 204700 25862 7352 178838 3.61R 

81 4535 4950 59137 16394 -54187 -1.15 X 
85 2468 199719 40504 6726 159215 3.21R 

R denotes an obs. with a large st. resid. 


X denotes an obs. whose X value gives it large influence. 


MTB > regr c20 1 ce22 


The regression equation is 
C20 = 93.3 + 0.0236 C22 


Predictor Coef Stdev t-ratio 
Constant 93.33 27.78 3.36 
C22 0.02357 0.01391 1.69 
Ss = 125.5 R-sq = 3.3% R-sq(adj) = 2.2% 


Analysis of Variance 


SOURCE DF Ss MS 
Regression 1 45206 45206 
Error 84 l32272e2 15747 
Total 85 1367928 


L46 


Unusual aes 
C 


Obs. 2 20 Fit Stdev.Fit Residual St.Resid 
8 5638 0.0 : 55.8 -226.2 -~2.01RX 
9 1914 402.5 138.4 13.7 264.1 2.12R 
45 1684 434.3 133.0 13.6 301.3 2.42R 
67 1623 382.1 131.6 13.6 250.6 2.01R 
71 84G 452.4 113.2 18.4 339.2 2.73R 
81 4535 70.4% 200.2 41.1 -129.9 -1.10 X 
85 2468 446.9 151.5 16.9 295.4 2.38R 
R denotes an obs. with a large st. resid. 


X denotes an obs. whose X value gives it large influence. 
MTB > regr c20 2 c5 c2e2 


The regression equation is 
C20 = 92.8 +0.000049 C5 + 0.0106 C22 


Predictor Coef Stdev t=-ratio 

Constant 92.84 27.47 3.38 

C5 0.00004890 0.00002863 lear gal 

C22 0.01060 0.01571 0.67 

s = 124.1 R-sq = 6.6% R-sq(adj) = 4.3% 

Analysis of Variance 

SOURCE DF ss MS 

Regression 2 90137 45069 

Error 83 1277790 15395 

Total 85 1367927 

SOURCE DF SEQ SS 

C5 1 83125 

C22 1 7013 

Unusual Observations 

Obs. C5 C20 Fit Stdev.Fit Residual St.Resid 
8 364000 0.0 170.4 64.1 -170.4% ~1.60 X 
9 234000 402.5 124.6 15.8 277.9 2.26R 
45 374000 434.3 129.0 13.6 305.3 2.48R 
71 93000 452.4 106.3 18.7 346.1 2.82R 
81 1349000 70.4 206.9 40.8 -136.5 -~1.17 X 
85 2117000 446.9 222.5 44.8 224.4% 1.94 X 
86 3433000 132.4% 285.4% 81.8 -153.0 -1.64 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > plot c4& c5 
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MTB > plot cG c6 


210000+ x 


= % 
CG - x 
= % 
140000+ % 
= 2 
o % 
= % 2 
- xx x 
70000+ 2 % % * 
= x 
= % RN X 
- * 2% 
~ ¥OX¥RKRCSO x 
0+ repeat sh lai % % ao eae 8 
Siete teetentantentente etententertentertententents Hontetetetetetetetens tee tetetenteteteeds Teketeteteteteees += --C6 
0 By acins ieoddbeo eGaso0 0 meni 30000000 
MTB > delete 86 cl-c6 
MTB > delete 81 cl-c6 
MTB > delete 71 cl-c6 
MTB > delete G5 cl1-c6 
MTB >» delete 9 cl-c6 
MTB > delete 8 cl-c6 
MTB > praint cl-c6 
ROW Cl C2 C3 CG C5 C6 
l l 95 G8 0 95000 48000 
2 2 116 584 3520 116000 584000 
3 3 250 6210 0 250000 6210000 
3 G 166 3180 0 166000 3180000 
5 5 13 3524 100000 13000 3524000 
6 6 Gl2 6882 100000 412000 6882000 
7 7 26 320 30800 26000 320000 
8 10 551 7726 89608 551000 7726000 
9 il 190 2066 4693 190000 2066000 
10 12 488 5451 G0826 488000 5451000 
ii 13 iv 1386 17240 177000 1386000 
12 14 1869 798G 12568 1869000 7984000 
13 15 261 2496 0 261000 2496000 
14 16 G 120 3079 4000 120000 
15 17 Gi5 4604 9331 415000 G604000 
16 18 Gel 2860 20327 421000 2860000 
17 19 861 12264 74980 861000 12264000 
18 20 417 3607 1007 417000 3607000 
19 2l 150 1908 25684 150000 1908000 
20 22 26 153 1615 26000 153000 
21 23 G 967 73154 G000 967000 
22 24 289 4935 5453 289000 4935000 
23 25 1141 8001 123555 1141000 8001000 
24 26 421 1737 5159 421000 1737000 
25 27 198 847 2/757 198000 847000 
26 28 8G 619 2441 84000 619000 
27 29 G09 4567 8913 409000 4567000 
28 30 107 854 19714 107000 854000 
29 Su 159 87 159000 87000 
30 32 G72 1295 63558 472000 1295000 
31 33 108 133 2160 108000 133000 
32 3G 192 161 7200 192000 161000 
33 1 1166 1308 4380 1166000 1308000 
34 2 G03 3470 403000 3470000 
35 3 1030 1745 5389 1030000 1745000 
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10 
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MTB > hist c4 
Histogram of C4 N = 80 


Midpoint Count 
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MTB > let c30 = sqrt(c4) 
MTB > hist c30 


Histogram of C30 N = 80 


Midpoint Count 
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MTB > regr cG 1 cS 


The regression equation is 
C4 = 9302 + 0.0460 C5 


Predictor Coef Stdev t-ratio 
Constant 9302 6053 1.54 

cS 0.046016 0.009926 4.64 

Ss = 37977 R-sq = 21.6% R-sqladj) = 20.6% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 30999834624 30999834624 

Error 78 112496345088 14422607356 

Total 79 143496118272 


Unusual Observations 
C 


Obs. c Fit Stdev.Fit Residual St.Resid 
5 13000 100000 9900 5962 90100 2.G40R 
12 1869000 12568 95306 14856 ~82738 -2.37RX 
42 352000 115896 25499 4325 90396 2.40R 
64 1494000 146034 78050 11339 67984 1.88 X 
76 766000 132298 44550 5370 87748 2-335R 
80 2117000 199719 106718 17229 93000 2. /5RX 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr ¢c30 1 c5 


The regression equation is 
C30 = 76.5 +0.000114 C5 


Predictor Coef Stdev t-ratio 
Constant 76.48 17.01 4.50 

c5 0.00011362 0.00002789 4.07 

s = 106.7 R-sq = 17.5% R-sq(adj) = 16.5% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression 1 188978 188978 

Error 78 888001 11385 

Total 79 1076978 


dl. 


Unusual Observations 


Obs. cS C30 Fit Stdev.Fit Residual St. Resid 
5 13000 316.2 78.0 16.8 238.3 2.26R 
l2 1869000 112.1 288.8 41.7 -176.7 -1.80 X 
42 352000 340.4 116.5 12.2 224.0 2-11R 
64 1494000 382.1 246.2 31.9 135.9 1.33 X 
80 2117000 GG6.9 317.0 48.4 129.9 1.37 X 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 
MTB > regr c4 1 c6 


The regression equation is 
C4 = 13045 + 0.00468 C6 


Predictor Coef Stdev t-ratio 
Constant 13045 6767 1.93 
cé6 0.004678 0.001453 3.22 
S = 40297 R-sq = 11.7% Resq(adj) = 10.6% 


Analysis of Variance 


SOURCE DF ss MS 

Regressicn 1 16838246400 16838246400 

Error 78 126657953792 1623819776 

Total 79 143496183808 

Unusual Observations 

Obs. CS C4 Fit Stdev.Fit Residual St. Resid 
17 12264000 74980 70420 13539 4560 - X 
36 13236000 0 74967 14878 -74967 -2.00RX 
G2 1774000 115896 21345 5139 94551 2.37R 
64 2634000 146034 25368 4668 120666 3.01R 
76 7947000 132298 50224 7905 82074 2.08R 
78 14083000 55536 78929 16056 =-23393 -0.63 X 
80 6091000 199719 41541 589G 158178 3.97R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr ¢30 1 ¢c6 


The regression equation is 
C30 = 84.4 +0.000012 C6 


Predictor Coef Stdev t-ratio 
Constant 84.38 18.70 4.51 

C6 0.00001194 0.00000401 2.97 

s = 111.4 R-sq = 10.2% R-sq(adj) = 9.0% 
Analysis of Variance 

SOURCE DF ss MS 
Regression l 109626 109626 

Error 78 967352 12402 

Total 79 1076978 


Unusual Observations 


Obs. C30 Fit Stdev.Fit Residual St.Resid 
17 12264000 273.8 230.8 37.4 , -41 
36 13236000 0.0 242.4 41.1 -242.G -2. 3GRX 
G2 1774000 340.G 105.6 14.2 234.9 2.13R 
6G 2634000 382.1 115.8 12.9 266.3 2.41R 
78 14083000 235.7 252.5 GG.G -16.8 -0.16 X 
80 6091000 GG6.9 157.1 16.3 289.8 2.63R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > hist c5 


Histogram of C5 N = 80 
Midpoint Count 
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MTB > hist c6 
Histogram of CS N = 80 
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MTB > let c3l =logt(c5)} 
MTB > hist c3l 


Histogram of C3l N = 80 
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MTB > let c32 = sqrt(cé) 
MTB > hist c32 


Histogram of C32 N = 80 
Midpoint Count 
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MTB > regr c& 1 c3l 


The regression equation is 


C6 = - 88913 + 21964 C31 

Predictor Coef Stdev t-ratio 
Constant -88913 44333 -2.01 
C31 21964 8192 2.68 

S$ = 41042 R-sq = 8.4% R-sqladj) = 7.3% 


Analysis of Variance 


SOURCE DF Ss MS 
Regression 1 12107689984 12107689984 
Error 78 131383473344 1684467456 
Total 79 163496118272 


Unusual Observations 


Obs. C31 C4 Fit Stdev.Fit Residual St. Resid 
5 4.11 100000 1445 11359 98554 2.50RX 
14 3.60 3079 -~9798 15290 128377 0.34 X 
2l 3.60 73154 -9798 15290 82951 2. 18RX 
42 5.55 115896 32911 4782 82985 2.0GR 
64 6.17 146034 46700 7947 99334 2.47R 
76 5.88 132298 40328 6161 91970 2.27R 
80 6.33 199719 50025 8988 149696 3.74R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr c30 1 c3l 


The regression equation is 


C30 = - 181 + 57.1 C31 

Predictor Coef Stdev t-ratio 
Constant -181.3 122.0 -1.49 
C31 57.08 22.55 2.53 

s = 113.0 R-sq = 7.6% R-sqladj) = 6.4% 
Analysis of Variance 

SOURCE DF Ss MS 
Regression l 81764 81764 

Error 78 995215 12759 

Total 79 1076978 


sy 


Unusual Observations 


Obs. C31 C30 Fit Stdev.Fit Residual St.Resid 
5 G.11 316.2 53.5 31.3 262.8 2- 42RX 
14 3.60 55.5 24.3 42.1 $1.2 0.30 X 
el 3.60 2790.5 24.3 G2.l1 246.2 2. 35RX 
80 6.33 446.9 179.7 24.7 267.2 2.-42R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr cG 1 ¢32 


The regression equation is 


CG = - 1993 + 18.8 C32 

Predictor Coef Stdev t-ratio 
Constant =-1993 9845 =-0.20 

C32 12. a7 5.281 3.56 

S = 39776 R-sq = 14.0% R-sql(adj) = 12.9% 


Analysis of Variance 


SOURCE DF ss MS 

Regression 1 20089384960 20089384960 

error 78 123406778368 1582138112 

Total 79 143496118272 

Unusual Observations 

Obs. C32 CG Fit Stdev.Fit Residual St.Resid 
36 3638 0 66467 11338 -~66467 -1.74 X 
42 1332 115896 23070 4779 92826 2.35R 
64 1623 146034 28547 4452 117487 2.97R 
76 2819 132298 51054 7552 81244 2.98R 
78 3753 55536 68624 11897 -13088 -0.34 X 
80 2468 199719 44G4G8 6151 155271 3.95R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large influence. 


MTB > regr ¢c30 1 c32 


The regression equation is 
C30 = 43.0 + 0.0498 C32 


Predictor Coef Stdev t-ratio 
Constant G3.05 27.12 1.59 
C32 0.04980 0.01455 3.42 
Ss = 109.6 R-sq = 13.1% R-sqladj) = 11.9% 


Analysis of Variance 


SOURCE DF Ss MS 
Regression l 140675 140675 
Error 78 936304 12004 
Total 79 1076978 


Unusual Observations 


Obs. C32 C30 Fit Stdev.Fit Residual St.Resid 
36 3638 0.0 224.2 31.2 -224.2 -2.13RX 
Ge 1332 340.4 109.4 13.2 231.1 2.12R 
64 1623 382.1 123.9 12.3 258.3 2.37R 
78 3753 235.7 229.9 32.8 5.7 0.05 X 
80 2468 G46.9 165.9 16.9 281.0 2.60R 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X value gives it large infiuence. 
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MTB > regr c& 2 c5 c32 


The regression equat 
C4 = 688 + 0.0372 C5 


Predictor Coef 
Constant 688 
cs 0.03717 
C32 7.491 
Ss = 37875 R=-sq 


Analysis of Variance 


ion is 


+ 7.49 C32 
Stdev t-ratio 
9417 0.07 
0.01237 3.00 
6.285 1.19 
= 23.0% R=-sq(adj) = 21.0% 


SOURCE DF ss MS 

Regression 2 33037799424 16518897664 

Error 77 110458372096 1434524160 

Total 79 143496118272 

SOURCE DF SEQ SS 

c5 Ll 30999834624 

C32 1 2037964800 

Unusual Observations 

Obs. C5 CG Fit Stdev.Fit Residual St.Resid 
5 13000 100000 15233 7442 84767 2.28R 
12 1869000 12568 91326 15188 -78758 -2.27RX 
42 352000 115896 23749 4556 92147 2.45R 
6G 1494000 146034 68378 13919 77656 2. 20RX 
76 766000 132298 50278 7196 82020 2.2l1R 
80 2117000 199719 97865 18720 101854 3.09RX 


R denotes an obs. with a large st. resid. 
X denotes an obs. whose X vaiue gives it large influence. 


MTB > plot c4 c5 
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MTB > plot c4& c6 
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APPENDIX 6 


CHART OF COMPLEXITY 


Ranges 
project 0 O- $10,000- $50,000- $100,000- $500,000 
type $10,000 $50,000 $100,000 $500,000 & above 
NOT COMPLEX 
* of projects in range 6 a 9 ° 5 Q 
% of * in range to total NC projects i at! 32.4 26:3 14.7 8.8 Q 
& of NC projects to total in range Ca! so 42.3 41.7 30.0 0 
COMPLEX 
* of projects in range 10 14 12 4 = 1 
”% of * in range to total C projects 21.7 30.4 26.1 8.7 10.9 Ze 
® of C projects to total in range 08.8 95.9 90.0 $5;5 90.0 90.0 
VERY COMPLEX 
* of projects in range | | > 3 2 | 
” of * in range to total VC projects 9.1 9.1 27.5 21S 18.2 9.1 
% of VC projects to total in range 0.9 38 12.5 20.0 20.0 90.0 
total * of projects in range 17 26 24 12 10 2 
% of * inrange to all 91 projects 18.7 28.6 26.4 15.2 11.0 22 
% Of % With A/E % With AZE liability 
total lability of total 
(1) Z) 1x2 

NC 34/91 = 37.3% 28/34 = 82.3% 30.7 

C 46/91 = 50.6% 36/46 = 78.3% 39.6 

VC 11/91 = 12.1% 10/711 = 90.9% 11.0 
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